Gene Saro_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3403 
Symbol 
ID5077552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp2678 
End bp4873 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content65% 
IMG OID640481127 
ProductTonB-dependent receptor 
Protein accessionYP_001165789 
Protein GI146275629 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.181941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGATC TTACACTGCG CAGTCGCGCT GTGCTCGCCG TACTCATGTC CACCGTTGCC 
ACGCCAGCGC TTGCGCAAGC GGTCGACGAG CCCGCGAACG ACGGCGGCCT CGAAGCCATC
GTCGTCACCG CAGAGCGCCG CGAGCAGAGC CTCCAGGCCG TGCCGATCTC GGCCACCGTG
CTTTCGGGCG AAGAGCTTCA GCGCAAGGGC GTTTCCAACC TCAACGACAT CCAGCAGGTC
GCACCCTCGG TTGCCATCAA CACCTTCAAC CGCTCGACCT TCATCAACAT CCGCGGCGTC
GGCATCGCCC AGTCCGCGCC CACCTCGAAC CCCGGCGTCG CCTACTACAT CGACGGCCAG
CTCATCCCGC ACGAGCAGTT CATCGGCCAT TCGTTCTTCG ACATCGGCAC GATCGAGGTC
CTGCGCGGCC CCCAGGGCAC GCTGACCGGC CAGAACTCCA CCGGCGGCGC GATCTACGTC
CGCACGCCCG AGCCCGAGTT CGACAGCACC TTCGCCATCG GTGACGTCAC CGTCGCCAAC
TACGACCGCT ACCGCGCCGT CGCCGCGCTG AACCTCGGCG GCGAGGACGT CGCCCTGCGC
ATCGCCGGCG TGCACGAGGA GCGCGACAGC TTCACGCGCA ACATCGCGGC CAACGCGCAG
AGCCAGCCCG GCAACCTCAA CATGGACGCG ATCCGCGCCA ACCTGCGCCT GCGCGACATG
GACGGCCGCC TGACCGTCAA CGTGCGCGGC GAATACTTCG ACGTCCGCTC CGACAACAAC
GCGGTCAAGA ACCGCAACGA CAAGGTCAGC AGCAATCCGT TCGAGATCGA GGAAGACGCG
CTCTCGTTCC AGAACCAGGC CGGCTACCGC ATCTCGACCG AGGCACGCTA CGACGTGTCC
GACAGCGTCC AGGCGCGGGG CCTCGTGTCC TGGCAGGACG GCTATACCCA CGACCAGACC
GACGGCGACC GCACCGCGAC CGCCCAGGCC GTCCCCGCCA ACCTGTCCAC CAGCAGTGCC
AACACCCGGA CCTATCCCGG CCGCGTCAGC AATGGCGACA CGCGCTTCAA GACCCTGATC
GGCGAGTTCA ACCTGCTCTC CACCGACAAG GGGCCGCTGC AGTGGGTCGT CGGCGGCTTC
GTGATGGACG AAACCGTCCC CGTCACCTTG CTGCGCGACA ACCGCAACAC GCTCGACCTG
CTCCAGTCGA ACAGCTCGAT CATCACCGAG GCGAAGAACA CCTCGCAGTC GGTCTTCGGC
CAGGTGAACT ACTACGTCAC ACCCGCGATC GAAGTTCTGG CGGGTGCGCG CTACAGCTTC
GACAAGCAGG TCTATACCCG CTTAGCCGTC CCGGGCGCAG GCTTCACCCT GCCTTTCACC
AGCGAAGCGA AGTCGGAACA GCTCACTGGC AAGATCGGCC TCAACTACCA CTTCGGCGCC
GACAACCTGC TCTACGTGAC CGCATCGAAG GGCTACAAGG CGGGCGGCGT GAACCTCACG
CCCAACACTC CCGACTTCAA GCCGGAACGC AACTTCGTCT ACGAAGCAGG CTTCAAGACC
GAACTCCTCG ACCGCCACCT GCGCGTGAAC GGCGACGTGT TCTACTCGGA TTACAAGGAC
ATCCAGCTTT CCAGCCTCGT CGGCGGCCTG CCCACCACGC AGAACGCGCT GGCTGGCCGT
GCCTATGGCG GCGAACTTGA AGTCACTGCC CAGTTCGGCG GCTTCGCGGC GAACGCCGGC
CTCGGCTACC TCGATGCCAA GTTCAAGAAC TCGGCCTGCA TTTCCGACAC CAACGCCGCC
GGCACCGATC CTGGCTGCGC CACCAACCTG CGCTTCGTGC CCAAGGGCCG CGTCCTGCCG
TTCTCGCCGG AATGGACCGT CAACGCGGGC GTCCAGTACA CGCTCTCGCT CGGCAGCGTG
GACGTGACTC CGCGCGTGCA GTGGTCGTAC CTGTCGGAAC AGTACGCCAC CCCGTTCCCC
AGCGTGAACA CGCTGGTCCC GGGCCGCAAC CTGTTCGACG CGCGCCTCAC TTTCGACCTC
GGTCGCAAGT ACAAGCTCGA AGGCTTCGTC AACAACCTGA CCAACAAGAC CTACATCGCC
ACGCAGATCC AGAACAGCTC GAGCGCGGAC GGCGGCATCA TCTACGGTGC ACCCCGCACC
TGGGGCGTTC GCCTGAAAGT CGAGATCGGC AACTGA
 
Protein sequence
MPDLTLRSRA VLAVLMSTVA TPALAQAVDE PANDGGLEAI VVTAERREQS LQAVPISATV 
LSGEELQRKG VSNLNDIQQV APSVAINTFN RSTFINIRGV GIAQSAPTSN PGVAYYIDGQ
LIPHEQFIGH SFFDIGTIEV LRGPQGTLTG QNSTGGAIYV RTPEPEFDST FAIGDVTVAN
YDRYRAVAAL NLGGEDVALR IAGVHEERDS FTRNIAANAQ SQPGNLNMDA IRANLRLRDM
DGRLTVNVRG EYFDVRSDNN AVKNRNDKVS SNPFEIEEDA LSFQNQAGYR ISTEARYDVS
DSVQARGLVS WQDGYTHDQT DGDRTATAQA VPANLSTSSA NTRTYPGRVS NGDTRFKTLI
GEFNLLSTDK GPLQWVVGGF VMDETVPVTL LRDNRNTLDL LQSNSSIITE AKNTSQSVFG
QVNYYVTPAI EVLAGARYSF DKQVYTRLAV PGAGFTLPFT SEAKSEQLTG KIGLNYHFGA
DNLLYVTASK GYKAGGVNLT PNTPDFKPER NFVYEAGFKT ELLDRHLRVN GDVFYSDYKD
IQLSSLVGGL PTTQNALAGR AYGGELEVTA QFGGFAANAG LGYLDAKFKN SACISDTNAA
GTDPGCATNL RFVPKGRVLP FSPEWTVNAG VQYTLSLGSV DVTPRVQWSY LSEQYATPFP
SVNTLVPGRN LFDARLTFDL GRKYKLEGFV NNLTNKTYIA TQIQNSSSAD GGIIYGAPRT
WGVRLKVEIG N