Gene Saro_0168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0168 
Symbol 
ID3918304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp166719 
End bp169919 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content62% 
IMG OID640442894 
ProductTonB-dependent receptor 
Protein accessionYP_495451 
Protein GI87198194 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATCA AAACATTCTC TGCGCTCCGT GCGAGCGCCG CGCCGCTGGC GCTGGTCGTC 
GCGGGCTTTA CCGCATCGAC CGCCTTCGTC GCGCCCGCAA TGGCGCAGGA CTACACCCGC
GGTAACCTCG TGGGCGAGGT GCTCGACGGC AACGGCGCTC CCGTTTCCGG CGCGCAGGTG
ACGATCCGTT CGAACGAACA GGGCTTCACC AACACCACCA CCACCGATTC CAGCGGCCAG
TTCCGCGTCA CCGCTCTCCC GACCGGCACC TACTCGGTGA CCGTCACGGT CGACGGCGCG
GTCGTTGTCC AGGACAACTC GGCCAGCGTC GTTGCCGGCT CGAACAACTC GTACCGTTAT
TCGACGGGCG AAGCTGCTGC CGGCGGCGCG ATCGTCGTCA CCGGTTCGCG CATCAAGACC
AACGACTTCG CTCAGAACAC CTCGGGCCTC ACCCTGAACG TGCAGGAAGT TGCCGAGAGC
GTTCCGATCG CGCGTTCGCA GTCGGCCCTC ATCCTGCTTG CACCCGGCAC CAACGCCGGT
GACACCGGCT TCGGCGACTG CCCCGACTGC GTGAGCTTCG GCGGCGCGTC GATCGCCGAG
AACAGCTACT ACGTGAACGG CCTCAACACG ACGAACTTCC GTACCTTCGT CGGCAACAAC
GTCGTTCCGT TCGAGTTCTA TCGCACGTTC GACGTCAAGA CCGGCGGCTG GTCGGCCGAA
TACGGTCGTG CACTGGGCGG CGTGACCTCG GCGGTCACCA AGTCGGGTTC GAACAACTTC
GAATACGGTG CGGTCGTCGC CTACACGCCC GACTTCCTGA GCGAAGATTC GCCCAACACC
TACCTTGACG ATACCGGCTC GCTCAAGTCT CTCAACAGCC GCGACTACCG CGAGCGCCTC
GAGGCAAACT TCTACCTCTC GGGCCCGATC ATCAAGGATC GCCTGTTCTT CTACGGTCTG
GTCACGCCGC GCTACAGCGT GTCGGAAGAC ACCTCCCCGT CGTCGGGCTA TCGCGTCCGC
GCCAAGTCGA ACACCCCGTT CTACGGCGGC AAGCTCGACT TCATTCCGTT CGACGGTCAC
CGCATCGAAG GCACCTTCTG GTCCGACGAA CGCACGATCA ACTACGACTA CTACAACGTC
GACGCGCTCG GTAACGAGAA GACCGGCATC ATCACTGGGA TCAATCGCGA AGGTCGTGAA
ATCAACAAGA TCGGCGGCAA GAACTGGATC GTCCAGTACA CTGGCCAGTT CACCGACTTC
TTCACGCTTT CGGGTGCCTA CGGCGAGAAC CGCTACAAGC GCTATGACGT GATCAGTGGT
GGCGACAGCG CCGTTCCGAC CATCCAGACG CAGCTTGCCT ACGACGGCAA CGGCGAACCT
GTCACCTCGC TCAAGACCAT CGCCGGCGTG CCCGTTTCGC CGACGGACGG CCAGGACCTG
CGCAAGGTGA TGCGCATCGA CGCGGACCTC TATGTGAACC TGCTCGGTTC GCATCACTTC
CGCTTCGGCT TCGATCGTGA AGACCTGTCG GTTACCGAAG ACACCTTCTA CACCGGTGAC
CGCACCTATC GCTTCACGGC GAACTACATC CGCACCCGTA CCTACCTGAA CGAAGGCTCG
TTCAAGACCA AGCAGACGGC CTTCTACATC CAGGATAGCT GGGACCTCCT GAACGACCGC
CTCAACCTGC AGCTGGGCGT GCGCAACGAC CAGTTCCAGA ACTACGGCAT CACGGGCGGC
AAGTACCTCG ACCTCAAGAA CCAGTGGGCT CCGCGTCTTG GCGCGTCGTT CGACGTGTTC
GGCGACAAGC TGACCAAGAT CCAGGCGTTC TGGGGCCGCT ACTATCTGCC GGTCGCCACC
AACACCAACA TCCGCCTGGC CGGCGCCGAA ACCTACTACG AGCAGCGCTT CGGCTACGCT
CCGGGTGTCG TCGGTTCGAA CTATGACACG AACGGCGTTC CGATCGGCCA GCAGTTCGAC
AGCTCGGGCG CTCCGATCCT CGGCTCGCTC ACCGGCGCAA ACTCGCTGAA CTGCCCCGAC
TTCGGTCCGG GTGCCGGCCA GAAGTGCCGC ACCGTGTTCT CTGACGGCCT TCCCGGCCCG
ACGGACACCC TGGTCTCCTC GACGCTCAAG CCGATGTACC AGGACGAACT GATCTTCGGC
ATCACGCACC GTATGGAAGA CTGGACCTTC GGTCTTCGTT ACATCAACCG TCGTCTCAAG
CAGACGCTGG AAGACATCGC GATTGACGAA GCGGTCAACC GTTACTGCGA ACAGCAGAAC
CTCGATTGCG CAACCTCGTC GGGCAGCCCG ATCTGGTCGG GCTTCCACCA GTACGTTCTG
GCCAATCCTG GTGAAGCCGT CACCGTGCGC CTCGATGGCG ATCCGACGAA GCCGGGCACG
ACTGACGTCG TCACCCTGTC ACCGGAGCTG CTTGGCTATC CGAAGGCCGT CCGCAAGTAC
GACTCGATCG AGTTCACCGC GTCCAAGGCC TTCAACGGCA CCTGGGGCTT CGACTTCAGC
TACACCTGGC AGAAGCTTCG CGGTAACTAC GAAGGTTCGG TCAAGTCGGA CAACAACCAG
GACGACGCCG GCCTTACGCA GGACTTCGAC GTTCCGGGGC TGACCACTGG ATCGTACGGT
ACGCTTGCCA ACAATCGCGA GCATACCTTC AAGCTGTTCG GTTCGTGGCA GCCGGTTGAC
TGGCTCCGCA TCGGTGCAAA CCTGACCGTC CAGTCGCCGC GCAGCTTCAG CTGCATCGGC
GTCGCCATCC CGGACTACAT CAAGCTGCTC CAGGCTGGCG AAAGTGCGGT TCTGAACGGC
GGTGCGGCTT CGCAGTACGG CGCCGCGTCG TTCTACTGCC GCAACCCGAA GGGCAACCAG
AACGGTACGA CGGTCACGAA CGACATCACC GGCGAAACCA GCGTGCTGGT CAACCGTGGT
ACGGCGTTCA AGAGCGACTG GTCGAAGAAC CTCGACCTCG GCTTCCAGTT CAAGCTGGGC
GAGGCTCTGG GCAATTCGAA CTTCCGCATC GACGTGTTCA ACGTCTTCAA CTGGAAGTCG
AAGACCGACT TCGTCGAATT CGGCGAAACG GACTCGGGTG CCACCCGCGC GGACTATCGT
CTGCCGACCG GCTACCAGGC TCCGCGCCAG GTGCGCTTCA CCTGGACGAT GCGCTTCGGT
GCAAACAACG GCGCCGACTG A
 
Protein sequence
MKIKTFSALR ASAAPLALVV AGFTASTAFV APAMAQDYTR GNLVGEVLDG NGAPVSGAQV 
TIRSNEQGFT NTTTTDSSGQ FRVTALPTGT YSVTVTVDGA VVVQDNSASV VAGSNNSYRY
STGEAAAGGA IVVTGSRIKT NDFAQNTSGL TLNVQEVAES VPIARSQSAL ILLAPGTNAG
DTGFGDCPDC VSFGGASIAE NSYYVNGLNT TNFRTFVGNN VVPFEFYRTF DVKTGGWSAE
YGRALGGVTS AVTKSGSNNF EYGAVVAYTP DFLSEDSPNT YLDDTGSLKS LNSRDYRERL
EANFYLSGPI IKDRLFFYGL VTPRYSVSED TSPSSGYRVR AKSNTPFYGG KLDFIPFDGH
RIEGTFWSDE RTINYDYYNV DALGNEKTGI ITGINREGRE INKIGGKNWI VQYTGQFTDF
FTLSGAYGEN RYKRYDVISG GDSAVPTIQT QLAYDGNGEP VTSLKTIAGV PVSPTDGQDL
RKVMRIDADL YVNLLGSHHF RFGFDREDLS VTEDTFYTGD RTYRFTANYI RTRTYLNEGS
FKTKQTAFYI QDSWDLLNDR LNLQLGVRND QFQNYGITGG KYLDLKNQWA PRLGASFDVF
GDKLTKIQAF WGRYYLPVAT NTNIRLAGAE TYYEQRFGYA PGVVGSNYDT NGVPIGQQFD
SSGAPILGSL TGANSLNCPD FGPGAGQKCR TVFSDGLPGP TDTLVSSTLK PMYQDELIFG
ITHRMEDWTF GLRYINRRLK QTLEDIAIDE AVNRYCEQQN LDCATSSGSP IWSGFHQYVL
ANPGEAVTVR LDGDPTKPGT TDVVTLSPEL LGYPKAVRKY DSIEFTASKA FNGTWGFDFS
YTWQKLRGNY EGSVKSDNNQ DDAGLTQDFD VPGLTTGSYG TLANNREHTF KLFGSWQPVD
WLRIGANLTV QSPRSFSCIG VAIPDYIKLL QAGESAVLNG GAASQYGAAS FYCRNPKGNQ
NGTTVTNDIT GETSVLVNRG TAFKSDWSKN LDLGFQFKLG EALGNSNFRI DVFNVFNWKS
KTDFVEFGET DSGATRADYR LPTGYQAPRQ VRFTWTMRFG ANNGAD