Gene Saro_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3233 
Symbol 
ID3917491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3453763 
End bp3455184 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content64% 
IMG OID640446017 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_498502 
Protein GI87201245 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGTAA GCATCGTGAC GGTGCCGATC TATCTGCATG TCATCGGCGG CGAGCGCTAT 
GGCGCGCTGC TGATCGCCTG GCTTCTCCTC GGCTACTTCG GGCAGGCCGA TTTCGGCATC
GCCCGTGCGA TCACGCACCG GATCTCGGCC CTGGGACGGC AATCGCGCCA GCAGAATGCC
GAGACGCTGT GGTCGGGCAT TGCGGGCGTC ATGGTCTTCA GCCTGTTGAG CGCCCTCCTG
GTCTATGGAG CTTCCGGATA TTTCTTCTCC GGTCCGTTCA AGGTGGGCGA AAGCCTGCGG
GCCGAGATGA TGGCATCGCG GGTTGCGCTG GCGCTGTGCA TTCCGGTCAT TGCCATCACC
AGTGTCTTTG CGGGCGCCCT GATGGGGCTG GAGCGGTTCA AGCTGGTTTC GGCGGGCAAC
CTTGTCAGTT CCATCGCGAT GCAGGTCCTC CCCCTGCTCG TCGCGTACTA TGTCGGCAGC
AATCTCACCG GCCTTATCTT CGCGGCGCTT TTCGGCCGGG CGATCGGCCT TCTGATCGCG
GCGGGAGGTG CGTGGCTGAC GATGCTGCGA GGGCAGGCGA TATCGGTCTC CCTCGCCGAA
TTCCGCCACC TGTTCACGTT CGGCGCCTGG ATCCTGCTGA CGATGATCGT CGGACCCATG
ATGACGATGG CCGATCGTTT CGTGATCGGT GGCACGCTCG GGGCCACGGC GGTTGCGGTC
TATACCGTGC CGTTCCAGAT CGCGAGCCGC TCGGCCATGT TCCCCTATGC CGTATCGCAG
GTCCTTTTTC CGCGGTTCGC GTCCGACAGC GGGGAAAGAT CGCAGGAGCG ATGCCGCAGT
TCGACCGTGC TGATCGCGCA GGTCTACGCG CCAATGGTGA TCGGACTTTC CTGCCTCGCC
GCACCGCTGC TGCACTTGTG GATCGGCACA AAGCTTGATC CGCGATCCAT CCTCATCGGC
CAGATCGTGA TCACCGGATT CTGGGCCAAC GCAATCGCGG GTGTGCCTTA CGCTTATATC
CAGGCGCGGG GAAATCCGCG CTTTACCGCC CTGCTCCACG TGGCGGAGCT GCCATTCTAC
ACGGCTGCGC TGTATCTGCT GGGAATGAAC TACGGGCTCG CGGGCGTCGC CGCCGCCTTC
ACCTTGCGCT GCGCGGTCGA CTGCGTGCTG CTCATGGGAG CTGCGAAGTT GTGGACGCGC
GAGATGGCGG CACGGCTGGC GGGTCCTGTC GCGCTCGTGT TGCTTTCGAT TGTGGCCGGG
CAGATGTTCC AGGGCTGGGT CGGCGCATTC CTCGCTGCCT TCGTCCTTGG CGGGGCGGGC
GGCGTGCTCA TGCTTGTCCA AATGCCCGAC GAAGTCCGAT CCCAACTCGA CAAGACCGCG
CTCGCCCGCT TCCTGCCCCG CTGGACGGCG AAAGCGGCGT GA
 
Protein sequence
MVVSIVTVPI YLHVIGGERY GALLIAWLLL GYFGQADFGI ARAITHRISA LGRQSRQQNA 
ETLWSGIAGV MVFSLLSALL VYGASGYFFS GPFKVGESLR AEMMASRVAL ALCIPVIAIT
SVFAGALMGL ERFKLVSAGN LVSSIAMQVL PLLVAYYVGS NLTGLIFAAL FGRAIGLLIA
AGGAWLTMLR GQAISVSLAE FRHLFTFGAW ILLTMIVGPM MTMADRFVIG GTLGATAVAV
YTVPFQIASR SAMFPYAVSQ VLFPRFASDS GERSQERCRS STVLIAQVYA PMVIGLSCLA
APLLHLWIGT KLDPRSILIG QIVITGFWAN AIAGVPYAYI QARGNPRFTA LLHVAELPFY
TAALYLLGMN YGLAGVAAAF TLRCAVDCVL LMGAAKLWTR EMAARLAGPV ALVLLSIVAG
QMFQGWVGAF LAAFVLGGAG GVLMLVQMPD EVRSQLDKTA LARFLPRWTA KAA