Gene Saro_2458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2458 
Symbol 
ID3916777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2649885 
End bp2652161 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content61% 
IMG OID640445213 
ProductTonB-dependent receptor 
Protein accessionYP_497728 
Protein GI87200471 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGAC GTCTTCGGGC AATTTGCCTT GCGCCTCTGG GGGCCGCACT GGTCCTCCCG 
GCAGCCGCCC ACGCACAGTC GGCGCAGGCC GGTGATGCAG CCGACTCCTC GGCTCCTTCC
ACAGCCGAAA TCATTGTCAC CGCAACGAAG CAGAATACCG CGCTCAGCCG CGTTGCTGCT
GCCGTTACTG CCGTGACTGC GGCGGATATC GGTCCCGGCG GCGTCCAGAA CGTGGGAGAC
CTGCAGGTTG TCGTGCCGAA CGTTTCGATC GGCGATCAGT TCGGCGTCAA CCGTACCTTC
ATTCGCGGCG TCGGCCTGAA CAACTTCGAT GTCGGCGGGG AAGGCGCGGT CGCGTTCCTC
CAGAACGGCG CAATCCTGGC GCGTCCGGCC CAGCAGCTCA GTGGCTTCTT CGACCTTTCC
CAGATCGAAG TCCTGCGCGG ACCGCAGTCG ATCCTTTACG GTCGCGGCGC CACTGCTGGC
GCGATCAATC TCGTTACCGC AGCTCCGACC GACGAGATGG ACGGGTATCT GCGAGCCACC
TATGGCAACT ATGACAACCG GGTGCTTGAA GGCGCGATCG GCGGTCCGCT CGCGGGCGAC
CGCTTGATGG TGCGCCTCGC CGGCAAGTAT GAAAAGCGCG ACGGTTACGG CGTCAACCTT
TTCACCGGTA ATCCGGTCGA TGACCGCGAT GCCTATTCCT TGCGCGCTAC CGTGGTCGCC
AAGCTCAGCG ATACGTTCAA GGCGACGATC GTTGGCGATC ATTTCAAGGA AGACGACAAC
AATTATGCCT TCCACTATTT CGGTCCGTCG GTAATTCCCG AAGCAGGGCT GTTCCACAAG
TTCATCGGTG GCAAGTCGAT CTTCGACTAT TACGCCGCGC GCAACGAAAA GCCCAACCTG
CGCAATATCT ATTCCGATGA AGAGGCGATC AACAACCGCA AGGGCTATGG CGTGACCGGC
ACGCTCGACT GGGACCTGGG CGCTGCATCG GTCAAGTCGA TCACCGCCTA CCGCACATTC
GACCGTTTCC AGCGCGACGA CCTCGACGTG TCCGACGTCA ATCTTGCCGG CCAGAACAAC
TACGTCGAAA AGAGCCGCGC CTTCAGCCAG GAAGTCACAG TCAACTACGA AGGCGATGGT
TTCACACTGC TCGGCGGTGC GATGTACCTG CACGAGAAGT TGACTGGGCA GGTGCTCGTG
CCGACCGTGA ACCTCGGCGT CCTGTTCGGC CTGCCGGCCG ATACGTTCGA CGATGGCGCC
TACGAGCAGA ACGGACTGGT CAAGATCGAT GCCGTCGGCG TCTATCTCCA GGGTGCGGTC
GATCTCAGCC CCACGATCAA GCTCACCGCC GGCGCGCGCT ACAACTATGA GCATCGCAAG
GGTGATGGCT ACTTCCGCTT CGATGCGCTC GGCGTGAACA TTCCGACCGA CAAGTCCAAG
GGTTGGTCCT CGGTTACGCC GAAGGTGTTG CTGGAGTTCA AGCCGACTGA CCGCACGCTC
CTCTACGCAA GCGTCACAAA GGGCTTCAAG TCGGGCGTCA TCAACATCGG CAGCGTCGAT
GCGGCGATCG ATCCAGAAAC GGTCTGGAGC TACGAAGCGG GCTTCAAACA GAAGCTGGCG
GACAATCGCG TCCTGCTCAG CGGTGCGGTG TTCTACTACG ATTACAAGAA CCTGCAGGTC
AGCTTCGTCA ACGCCAACTC GATCGTGCAG ACGATCAATG CGGCAAACGC GCGCAACTAC
GGCGGCGAGC TGGAGCTTGA AGGCAAGATC ACGCCGCAGT TCTCCGTGAA CCTCAACGCG
TCCTACCTCA ATGCTCAGTT CACGAAGTTC TGCAACGCCT ACTACGGCGC GGCCTTCCCG
GCGCGGTCCG GCATTTCCTA TCCGGTTTGC CCCACCGATT CCGGCCTCGT CGACCTCAGC
GGAAAGCGCC TTCCGAACGC GCCGCGCTTC ACCGTTGGCG GTGGCTTCAA CTGGGACATC
CCGCTGGGCA ACGACAGCCG CCTTGCCGTC AACGGCGAGA TCAAGTGGCA GTCGAAGGTC
TACTTCACCG AGTTCAACAA TCGCGATGCG GAACAAGGTT CCTATGCCAT GGCGAATGCC
GGTCTGACCT GGCACGCACC GGGCGACCGC TTCTCGATCG GCGGGTGGGT GAAAAACATC
ACCAACGAGT TCGTGATTGC GAACAACATC ATCACTGCCG CCACGTTCGC CTATGTCCGC
GTCGGCTCGG TCATGCCGCC GCGCACCTAC GGGGTGACTG CCAGCGTGAA TTTCTGA
 
Protein sequence
MKRRLRAICL APLGAALVLP AAAHAQSAQA GDAADSSAPS TAEIIVTATK QNTALSRVAA 
AVTAVTAADI GPGGVQNVGD LQVVVPNVSI GDQFGVNRTF IRGVGLNNFD VGGEGAVAFL
QNGAILARPA QQLSGFFDLS QIEVLRGPQS ILYGRGATAG AINLVTAAPT DEMDGYLRAT
YGNYDNRVLE GAIGGPLAGD RLMVRLAGKY EKRDGYGVNL FTGNPVDDRD AYSLRATVVA
KLSDTFKATI VGDHFKEDDN NYAFHYFGPS VIPEAGLFHK FIGGKSIFDY YAARNEKPNL
RNIYSDEEAI NNRKGYGVTG TLDWDLGAAS VKSITAYRTF DRFQRDDLDV SDVNLAGQNN
YVEKSRAFSQ EVTVNYEGDG FTLLGGAMYL HEKLTGQVLV PTVNLGVLFG LPADTFDDGA
YEQNGLVKID AVGVYLQGAV DLSPTIKLTA GARYNYEHRK GDGYFRFDAL GVNIPTDKSK
GWSSVTPKVL LEFKPTDRTL LYASVTKGFK SGVINIGSVD AAIDPETVWS YEAGFKQKLA
DNRVLLSGAV FYYDYKNLQV SFVNANSIVQ TINAANARNY GGELELEGKI TPQFSVNLNA
SYLNAQFTKF CNAYYGAAFP ARSGISYPVC PTDSGLVDLS GKRLPNAPRF TVGGGFNWDI
PLGNDSRLAV NGEIKWQSKV YFTEFNNRDA EQGSYAMANA GLTWHAPGDR FSIGGWVKNI
TNEFVIANNI ITAATFAYVR VGSVMPPRTY GVTASVNF