Gene Saro_1589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1589 
Symbol 
ID3918697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1647915 
End bp1650911 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content64% 
IMG OID640444329 
ProductTonB-dependent receptor 
Protein accessionYP_496863 
Protein GI87199606 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.36828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACCTC GTGAAGGGCT ACGACTCAAG TGGCCCGTGG AGGAGGATGA CATGCATATG 
AGGGGCACTC GCCTTGCAAT TTCCGCGTCC CTGCTGACCG TGAGCTTCGC GCTCGCAGCG
CCGGCGTTCG CACAGGAAGC AGCGCAGGAC AAGGCGGCCG AAGCGCCGGT CGACGCCAGG
GAAATCGTCG TTACCGGGTC GCTCATCCAG CGTCCCAACA ACACCGCTGT CAGCCCGATC
GTGAGCGTCG GCGAGGCCGC GCTCAAGGAG ACCGGCCAGG TCAACCTGCA GGACGCGCTG
AACCAGTTCC CGAGCTTCAC CACCGCCGGC AACGCCAACA CCGGCGGGCA AGGCACCGGC
GGTCGCGCGT CGATCAACCT TCATGGCCTC GGGACCAACC GCAACCTCGT GCTGCTCGAT
GGCAAGCGCC TGCCGCTGTC GGACATCAAC GGCAACGTTG ACATCAACAT CCTGCCGGAA
TCGATCATGG GCGGCGTTGA CGTGATCACA GGCGGCGCCT CGGCGATCTA TGGTTCGGAC
GCGATCTCGG GCGTGGTCAA CTTCAAGACC CTGCGCGCTT TCGACGGTGT GCGCCTCGAC
GCGCAGAACT CGATCAGCGA GCGCGGCGAC GGTTACAAGT TCAACGGCTC GCTCGCTTTC
GGTACCGGTT TCGCGGAAGA TCGCGGTCAC GTGATCGCGG CGTTCAGCTA TGCCCAACAG
GATGCAGTCA ACGGCAGCAG CCGCGACTTC TTCTTCGACA AGACGCCGTC TTCGTTCATC
GGCACGGGCA CGTTCGTGCC CAGCGCCACC AATGCCCCGA CGGTCGATGC CGTCAGCGGC
GTGTTCAGCG GCTATGGCGT CGGGGGCCTG CCGAACAACG TGAACCTGCT GAACCTCGGG
TTCAACAACG ACGGCACGCT GTTCACCCAG ACCGGTGCGC GCAACTACAA GGGCGCGAAC
GGCACCAACG GCTATCTCGT TGTCGGCAAC AACGTGCGCA TGCCGGTCGG GCAGCAGATC
GACTTTGCGA ACGCACTCAA GCGCAAGACC GCATTCCTCA AGGCCGACTA CGACCTGACG
CCTTCGCTGA CCGCCTATGG CCAGTTCATG TACGTCGATC TTTCGGTCCA CACCGCTTCC
GGCGGCAGCC TTACCCAGTT CGGCACGCTC ACCACGGTTC CGGTGACCAA CCCGTTCATC
CCCGATGATC TCAAGACGAT CCTCGCCTCG CGCCCCAATC CCAACGCCCC GTTCACCTGG
AACGGCCGTT ATGTCGGCGT GCCCTACAAG AACTGGGACG AGAACTACGT CGTCGAGCAG
TACATGGCTG GCCTGAAGGG CGATATCGCC AGCGGCTGGT CGTTCGACCT CTTCGCCTCG
TACGACCAGT CGGTCCACGA CCAGCAGCTC AACGATGCGG TGATCAAGGG CCGGGTGCAG
ACGCTGTTGA ATGCGGCAGA CGGCGGCGCG TCGATCTGCG CGGGCGGGTT CAACCCGTTC
GGCGATGCCA ATGCCCGCTC GCTGTCGGAT GCCTGCGTCA ACTACATCAC CAAGACAGCC
TTCTCGAAGG AGAAGCTCGC CCAGACCCAG GCCCAGCTCC AGGTCAACGG CAAGCTCTTC
GACCTCGGCG CGGGTCCGGC GCAGATCGCG CTGGTCGCCG GCTATCGCAA GAACACCTAC
TCCTATGTGC CTGACAGCGA TCTTGCCGCG CAGAACATCG AGGCGGTGAT CGCCTCGCAG
CCTGCCTCGG GCCGTATTTC GGTCAAGGAA TTCGCGGCGC AGGTCGATAT TCCGCTGCTC
GCCGACAAGC CGTTCGTGCG TGAACTGGGC ATCGGCGGCG CGATCCGTCA TTCGGACTAC
TCGGTCACCG GCGGGGTCAC CAGCTACGAA GTCGACGCCC GCTGGCGTCC GGTCGATGCG
CTGCTGTTCC GCGGCAGCTA CCAGCGCGCG GTGCGTGCGC CGAACATTGG CGAACTGTTC
TCGCCGCAGA CCGGCACGCA GCTCGTCATC GGCACGCCTC CGGGCGCGCT GGGCGATCCT
TGCGACGTTC GCTCCAACGC CCGCACCGGC GCCGATGGCG CGAAGGTGGC TGAGCTTTGC
GTCGCGCAGG GCATCCCGCA GGCGGCCATC TCGAGCTACA CCTTCCCGAC CACGGCAACG
GGCCAGCTCG TGTCGGGCAA CACCAACCTG ACGCCTGAGC GCGCCGATAC GTTCAACGTC
GGCTTCGTAC TAAACTCGCC GTTCGAAAGC GGCCCGCTTG CCGACTTCAC GCTCTCGGTC
GATTACTACA ATATAGCCGT GAAGAACGTG ATCTCGACGG TGCCCGGCCT GACCGTGCTG
TCGAAGTGCT TCAACCTCGA CGGTTCGAAC CCGACCTACG ACAAGAACAA CCTCTATTGC
GGCCTGCTGG TGCGCGACAA CAGCGGACAG CTCACGACGG TTGCGACGCC GTACCTCAAT
CTCGGCGCGG TGAAGACCGA TGGTGTGGAA GCGCAGGTCA ACTGGACGGT GCCGGCGCGC
TTCCTCGGCG ACGAGAGCGG GCGGTTCTTC GTCAACTCGG CAGTGGGCTG GCTGCACAAG
TACAAGTTGC AGCTGCTGCC TGGCGCGGCC TATCTCGACT ATACCGGCAT CAGCAACGGC
GCCGCCGGCG TCAGCAGCCT GCCGCCGCGC GCGACGCCCA AGTGGAAGGC CGTGACGAGC
TTCGGCTACC GTTCAGATGC GGCAACGCTC GGCCTGCGCT GGCGCTACCA GAGCAAGATG
GAGGACACGA CCTCGGTCCT GACGCCGGCG ACAGCGCAGG TGGGCGTGCC GGCATATGCG
CTGTGGGACA TGTTCGGCTC TGTCCGGATC GCCAAGACCT TCGAGATCCG CGCGGGCGTG
AACAACCTGT TCGACAAGGG ACTGCCGTTC GTCGCCAGCT CGCAGAACGG CACCGACGTG
GCGCTTTATG ACCCGATCGG CCGCTCGTTC TACCTTGGCG CCAAGGTCAG CTTCTGA
 
Protein sequence
MRPREGLRLK WPVEEDDMHM RGTRLAISAS LLTVSFALAA PAFAQEAAQD KAAEAPVDAR 
EIVVTGSLIQ RPNNTAVSPI VSVGEAALKE TGQVNLQDAL NQFPSFTTAG NANTGGQGTG
GRASINLHGL GTNRNLVLLD GKRLPLSDIN GNVDINILPE SIMGGVDVIT GGASAIYGSD
AISGVVNFKT LRAFDGVRLD AQNSISERGD GYKFNGSLAF GTGFAEDRGH VIAAFSYAQQ
DAVNGSSRDF FFDKTPSSFI GTGTFVPSAT NAPTVDAVSG VFSGYGVGGL PNNVNLLNLG
FNNDGTLFTQ TGARNYKGAN GTNGYLVVGN NVRMPVGQQI DFANALKRKT AFLKADYDLT
PSLTAYGQFM YVDLSVHTAS GGSLTQFGTL TTVPVTNPFI PDDLKTILAS RPNPNAPFTW
NGRYVGVPYK NWDENYVVEQ YMAGLKGDIA SGWSFDLFAS YDQSVHDQQL NDAVIKGRVQ
TLLNAADGGA SICAGGFNPF GDANARSLSD ACVNYITKTA FSKEKLAQTQ AQLQVNGKLF
DLGAGPAQIA LVAGYRKNTY SYVPDSDLAA QNIEAVIASQ PASGRISVKE FAAQVDIPLL
ADKPFVRELG IGGAIRHSDY SVTGGVTSYE VDARWRPVDA LLFRGSYQRA VRAPNIGELF
SPQTGTQLVI GTPPGALGDP CDVRSNARTG ADGAKVAELC VAQGIPQAAI SSYTFPTTAT
GQLVSGNTNL TPERADTFNV GFVLNSPFES GPLADFTLSV DYYNIAVKNV ISTVPGLTVL
SKCFNLDGSN PTYDKNNLYC GLLVRDNSGQ LTTVATPYLN LGAVKTDGVE AQVNWTVPAR
FLGDESGRFF VNSAVGWLHK YKLQLLPGAA YLDYTGISNG AAGVSSLPPR ATPKWKAVTS
FGYRSDAATL GLRWRYQSKM EDTTSVLTPA TAQVGVPAYA LWDMFGSVRI AKTFEIRAGV
NNLFDKGLPF VASSQNGTDV ALYDPIGRSF YLGAKVSF