Gene Saro_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3744 
Symbol 
ID5077892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp382668 
End bp384929 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content65% 
IMG OID640481467 
ProductTonB-dependent receptor 
Protein accessionYP_001166129 
Protein GI146275969 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAACA GTATGGCGCG TGGCGCTTCG TGGCTCGCAG TGGCAGTGGC TGCCGGCGGT 
CTGGCCCAGG TACCAGCGCG CGCGCAGGAC GCGGCTGCGG TCGACGGCGC GGAAGAAATC
GTCGTCACCG CGCGCAAGAC CACAGAACGT CTGCAGGACG TGCCGATCGC CGTTACCGCG
CTGTCGGCCA GCGCCCTCGA TGCGCGTGGC ATCGGTTCGG TGACCGATCT CCAGCAGGTC
GCCCCCAATC TCCAGTTCAC GCCCGGTACC GGCGGTAACG GCGGCGCCAT CGCGCCGTTC
ATCCGCGGCG TGGGCGAAAA CGATTTCATC ATCACCGCGG ACCCCGCGGT CGGCACGTAT
TTCGACGGCG TCTATGTCGC CCGCACCTTC GGTGCCTCGC CCGAATTGCT CGACGTAGAG
CGGGTCGAAG TGCTGCGCGG ACCGCAGGGC TCGCTGTTCG GCAAGAACAC CGTGGGCGGC
GCGATCAACG TCGTTACCCG CATGCCCGGT GACACGGCCG AGTTCGAGGG CGATGTCCGT
TACGGCTCCT ACAACGACTT CCGCGTCCGT GCCCGCGCCG CGCTTCCGCT TGGTGGCGGC
TTCTCGCTCG GCCTTTCGGG GCTTGGCGAA TGGGGCGACG GTTGGCAGCG CGTGCCCTCG
GGCAAGGATC TTGGCAACCG CAACGTGGTG AATGGCCGCG CGGTCCTGCG CTACCAGGGC
GGCGCCTTCG AGGCGATCGC ACAGGTTGAC GGCCTGCGTC GCCGCCAGAA CTCGGCCGCG
CACAGCATGC TGGACTTCAC GCCGACGTTC TTCTCGACGC TGCAATCGCT GTTCATCGCG
CCCTGTTGCA CTGTGCCGGA TCGCATCGAC CGGACCGACA CGACGCCCTC GCTCAATCGC
GACCACGCGG ATGCGGCAAA CGCCTCGCTT ACGCTGAGCT ACGACCTTGG CGGCGCGAAG
CTCAAGTCGA TCACCGCCTA CCGCTGGGTC CATGCCCAGT TCGGCCGCGA CGGTGATGCC
TCGGCTGAGG TCAACTACGC CGGCGATTTC CACAATGAAC GCGCCCGCCA GTTCAGCCAG
GAACTGCAGT TCACCACCTC GATCTTCGAT CGCGGCTCGC TCCTTCTTGG CGCGTACTAT
TACCGCGAGC GGACGAAGGA TCTCACCCGC CTCGTCGTGG CCGACGGTCT CTACGACGCA
TCCGGCTTTG CTGAATTCTC GACCGACGTG CTCGAACTGC CGCCCGAATT CCTCGATTTC
AACATCGACT TCGACAACCG CCAGACGACC ACCAACTTTG CCCTGTTCGG CAACGCCACC
GTGCCTCTTG CCGAAAGGCT GACGCTGGAA CTGGGCGGTC GGTACACGCA CGAGAAGAAG
GCGTTCTACC AGGCCGCGAA CCGGGTCTAC AGCAACACCC CGCTGTTGTT CGGCACGCCT
TCCTACGAGC TTGAACAGAG CTGGGATGCC TTCACCCCGC GGGTCTCGCT CAGCTACAAG
CTGCGCGACG ACGTTCTTGC CTATGCTTCC TGGTCGCAGG GCTTCCGCAG CGGCGGGTTC
AACGGTCGCC CGACCTCGCT AGAGGAAATC GGCTCCTACG ATCCCGAGCA TCTCGATGCC
TTCGAGGTCG GCGTGAAGAG CCAGTTCGGC CGCATCCTCA CGCTCAACCT CGCCGTCTTC
CGCAACCAGT ATCGCGACCA GCAGCTCCTC ATCAGCACGG TCAGCGAGAA CACGGGCCTG
ATCGTCGTTC GTACCGAGAA CGCCGGCAAG TCGCGCATCC AGGGAATCGA GCTGGAGGGC
ACGGTCCGCG TCTCGCCGCG CTTCCGCATC GACGGTTCGC TCGGCCTGCT CGATGCGAAG
TACCAGAAGT ACGTCTCGGT CATTGCCGGC GTTCCGACCG ACGTCAGCGG GCGCAAGCTG
AAGCAGGCGC CCGAAGTGAC CGGCAGCCTC GGGATGTCCT ATACGCTTCC GCTTGGCGAG
CGGATGGACG CGACCTTCCG TGCCGATGCG ACGTATCGCA GCGCGAACTT CATCGATGTG
GAGAACACGC CCGAACTGCG CGCGCCGGAT CATGCGATCC TCAACCTCAG CACCACCCTG
CGCCTGCCGG TGGACGGTCT CTCGCTGCGG CTGGCGGTGG ACAACGTGAC CAATCGCCGG
ATCATCGTTG CGGGTTATGA TGCCCGGACT TCGTTCGGCT TCCTCGAAGG CTACTTCAAC
GAACCGCGCC GCTACTGGGC GACGCTCTCG TTCAGGCGCT GA
 
Protein sequence
MRNSMARGAS WLAVAVAAGG LAQVPARAQD AAAVDGAEEI VVTARKTTER LQDVPIAVTA 
LSASALDARG IGSVTDLQQV APNLQFTPGT GGNGGAIAPF IRGVGENDFI ITADPAVGTY
FDGVYVARTF GASPELLDVE RVEVLRGPQG SLFGKNTVGG AINVVTRMPG DTAEFEGDVR
YGSYNDFRVR ARAALPLGGG FSLGLSGLGE WGDGWQRVPS GKDLGNRNVV NGRAVLRYQG
GAFEAIAQVD GLRRRQNSAA HSMLDFTPTF FSTLQSLFIA PCCTVPDRID RTDTTPSLNR
DHADAANASL TLSYDLGGAK LKSITAYRWV HAQFGRDGDA SAEVNYAGDF HNERARQFSQ
ELQFTTSIFD RGSLLLGAYY YRERTKDLTR LVVADGLYDA SGFAEFSTDV LELPPEFLDF
NIDFDNRQTT TNFALFGNAT VPLAERLTLE LGGRYTHEKK AFYQAANRVY SNTPLLFGTP
SYELEQSWDA FTPRVSLSYK LRDDVLAYAS WSQGFRSGGF NGRPTSLEEI GSYDPEHLDA
FEVGVKSQFG RILTLNLAVF RNQYRDQQLL ISTVSENTGL IVVRTENAGK SRIQGIELEG
TVRVSPRFRI DGSLGLLDAK YQKYVSVIAG VPTDVSGRKL KQAPEVTGSL GMSYTLPLGE
RMDATFRADA TYRSANFIDV ENTPELRAPD HAILNLSTTL RLPVDGLSLR LAVDNVTNRR
IIVAGYDART SFGFLEGYFN EPRRYWATLS FRR