Gene Slin_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4238 
Symbol 
ID8727997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5112502 
End bp5115807 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389021 
Protein GI284039091 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.324015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAA TTTTACTGAT GAGCTTACTT CTGGTATGCT CATTCTGGCT TCCAGCCTGG 
GCTCAGGAAC GAACAATTAC AGGTAAGGTT ACGGCCGCCG AAGATGGTAC GCCTTTACCG
GGTGTATCTG TTGTGTTGAA GGGAGTGGCC CGGGGAACGA ATAGCGATGC TAACGGTGCC
TACTCACTCA ATGTCCCGAC AAAAGGGGGA ACGCTGGTAT TCAGCTTTGT TGGAGCGGCT
TCGCAGGAAA TCGAAATCGG CAACCGTTCC GTTATTGACG TTAAACTGGC GAACGACGCC
AAGCAATTGG GTGAAGTGGT TGTAACGGCT CTGGGCCAGC AACGGGATAA GAAAGCACTG
GCCTATGCCG TCTCCAACGT AAAAGGAGAT GTGCTCCAGC AACGGTCGGA GCCGGACCCG
CTGCGGGCCT TATCGGGTAA AGTACCGGGG GTAAATATCA CGGCCGGTAA CGGAGCACCC
GGTGCGGGTA CGCGGATTAC CATCCGGGGT AACAACTCCT TCACAGGTAA CAACCAGCCG
CTGTTCGTTG TCGATGGTAT TCCTTTCGAT AACTCTGTAA ATACTCCACA AAATGGTAGC
CAGGGCTACA ACACAAACAC TGTAACAACA AACCGGGCAT ACGACATTGA CCCGAACAAC
ATTGAAGCGA TGACCGTACT GAAGGGTGCG GCTGCATCGG CGTTATATGG CTCACGGGCT
GCCAACGGCG TTATTGTTAT CACAACCAAA TCGGGTAGCA AGTCGGCGCG GAAAGGTCTT
GAGATTAATT TCAACACCTC GTACTCAGTC GAAAACGTAT CAACGGTTCC TGATTACCAG
AACACCTATA CGCAGGGCTC CAACCAGACC TACAACGGTG GGTTTATCGG AAACTGGGGA
ACTGTTTTCC CATCGGAGGT TGACCGCATC AACGCCGGCC TGGGTTTTGA GCGGTATTCA
AAAGTAGTTG ATCCGGATTA TCCGGCGGGT ACCATTCCTC ATCCATTGGT CGATGCAACG
GTACCCTATG GTGCTGCCCG CTACCAATCG GCTTTCCCTG AATTGCTCCA GTCAAACGGC
CGGGGTATTG CGGTGCCACT CAAGCCTTAC GATATTATTG GCGGCTTTTT CCGGACGGGT
AAAGTGATGG AAAATGGTAT TCAGATCACC TCTACCGGTG ATAAAACATC GCTGAACGCG
TCGGTCTCAC GAACCAAAAA TGAGGGTATT ATTCCAAACT CATTCACGGA CCGTACCACC
CTGAGCTTTG GTGGTAATGC TACCCTCACC AACAAAGTAA ACGTAGCCGG TAGCGTAGCA
TATACGAATA CCAATCAGCA AAGTCCACAG TCGGGTGCTG GTTACTACGC TGACTACGGC
GGGCTGGCTT CTGCCGGTTC TATCTACAGC CGTTTGTTCT ATCTTCCCCG TAACTTCGAC
CTGAACGGCT ATCCGTTTGA AAACCCCGTA GATGGTTCAA ACGTATTCTA CCGGGCGCTC
GATAACCCGC TCTGGACCGC TAAGTATAAC CTCTATAACT CCAGCGTGAA CCGGGTTTAT
GGAAATATGA CGTTGAGCTA TGACGTTACG CCCTGGCTGA ACTTCACCGC CCGTGGTGGA
ATAAATACGT ATTCAGAAAC CCGCAAAAAT GTCCTCCGTC CCGGTGGTTC GTTTTCTCCG
CTCGGTTCGG TGTCCCGGAC AGATTTGACG AATACAGAAA TCGATTTTAC CTTTCTTGCT
ACAGCGCAGC ACGATTTTTC GGAAAAGATC AATGCCAAGT TACTGGTTGG TTTCAACCCG
AACCAGCGGA CCTATACGGA ATCTTCAGTT AGTGGAGCGC CCGTTATTGA TCCAAACATC
CTGACAATTG GCGGTACACT GAACCAGAAC GCGGCCGATT ACCGGAGCCA GCGCCGTTTA
TACGGAATCT TCAGTGAGTT AACGCTGGGT TACGGCAACT TCCTGTTCCT GACGGCTTCG
GTTCGTAATG ACCAGTCATC GACACTGCCA GCTAAAAATA ACAGCTACTA CTACCCGGCC
GTATCCGGTT CGTTTGTCTT CTCCGACGTG CTGAACCTGC CCAAGAACAT TATCAATCTG
GGTAAACTGC GGGCCAACTA CGCCAAAGTG GGTAAGGATG CTTCGCCCTA TCAGGTATTT
ACAGCCTATA ACTTGGGTCG TACGTTCTAC AACGGTACCG CCATTTCAAC GGCCAACCTG
CCAAGCCAGT TGAACAACGT CAATTTGAAG CCTGAGTTTA CATCGGAGGT GGAGTTAGGT
ACTGAACTCC AGTTCTTTAA TAGCCGTATT GGTATTGACG CAGCTTACTT TGATCGGGTA
TCGACTGACC TGATTGTTAC CCGGGAGCTA CCCCGTACAT CAGGCTTTGC TACTGAAATA
ACGAACGCCG GTAAAATCTC GAACAAAGGT TGGGAGATTG GCCTGACGCT CGTTCCGCTA
CGGATGGCTA ATGGCCTGAC CTGGACATCG TACTTTGCAT ACACAAGTAT TAAGTCGAAA
GTGGAGGATG CTGGTCCCGG TGGTGAAATT TTCATTGGTG GCACGGGCCT TTCTTCGCTG
GGAACCATTT TCCGGAACGG CTTGCCGTAT GGGCAGATCA TCGGTTCGAA GAATGCTCGG
GATGATGCGG GTAACCTGTT GATTAACCCA AGCACAGGTC TTCCGATCCG GGCTGCTAAG
TCAGACATTA TTGGCGACCC TAATACGAAA TACCAGGTAG GCTGGACCAA TACGGTAAAC
TTCAAAAATT TCTCGTTGAG TGTTTTGATG GACTACAAAG CCGGTGGTAG CCTGTTCTCC
AGCACGGCTG CTTCGCTGCT GCTGCGTGGC CAGCTCAAGA ACTCCGAAGA TCGCGAAGGT
ATGCGGGTAA TTCCGGGTGT ACTGGGTGAT CCGGCTACCT ACAAGCCACT TGTAGGTGAC
AATGGTCAGC CGATAAAAAA CACCATCGCC ATGTCGGCCT TCCAGTATCA CTTTACGGAT
GGATACGGTG CTTACGGTGC TGACGAGGTA AACATTTACG ACGCTACGGT CGTGCGCTTA
CGCGAAGTAT CGCTGGGCTA TAGTGTACCT AAAGCCTTCC TGAAGCGGTA TGCTAAGGTA
TTTGGCAGCA TGCGTCTGTC AGTATCGGGT CGTAACCTGT GGTTCTACGC GCCTAACATG
CTGAAAGGGC TGAACTTCGA CCCTGAAGTA CTGTCAAACT TCGCCGATTC GAACATCCAG
GGCTTTGACC TGGGCGCTTC GCCATCCACG CGCCGTTTCG GTATTAACCT CAATGCTTCA
TTCTAA
 
Protein sequence
MRKILLMSLL LVCSFWLPAW AQERTITGKV TAAEDGTPLP GVSVVLKGVA RGTNSDANGA 
YSLNVPTKGG TLVFSFVGAA SQEIEIGNRS VIDVKLANDA KQLGEVVVTA LGQQRDKKAL
AYAVSNVKGD VLQQRSEPDP LRALSGKVPG VNITAGNGAP GAGTRITIRG NNSFTGNNQP
LFVVDGIPFD NSVNTPQNGS QGYNTNTVTT NRAYDIDPNN IEAMTVLKGA AASALYGSRA
ANGVIVITTK SGSKSARKGL EINFNTSYSV ENVSTVPDYQ NTYTQGSNQT YNGGFIGNWG
TVFPSEVDRI NAGLGFERYS KVVDPDYPAG TIPHPLVDAT VPYGAARYQS AFPELLQSNG
RGIAVPLKPY DIIGGFFRTG KVMENGIQIT STGDKTSLNA SVSRTKNEGI IPNSFTDRTT
LSFGGNATLT NKVNVAGSVA YTNTNQQSPQ SGAGYYADYG GLASAGSIYS RLFYLPRNFD
LNGYPFENPV DGSNVFYRAL DNPLWTAKYN LYNSSVNRVY GNMTLSYDVT PWLNFTARGG
INTYSETRKN VLRPGGSFSP LGSVSRTDLT NTEIDFTFLA TAQHDFSEKI NAKLLVGFNP
NQRTYTESSV SGAPVIDPNI LTIGGTLNQN AADYRSQRRL YGIFSELTLG YGNFLFLTAS
VRNDQSSTLP AKNNSYYYPA VSGSFVFSDV LNLPKNIINL GKLRANYAKV GKDASPYQVF
TAYNLGRTFY NGTAISTANL PSQLNNVNLK PEFTSEVELG TELQFFNSRI GIDAAYFDRV
STDLIVTREL PRTSGFATEI TNAGKISNKG WEIGLTLVPL RMANGLTWTS YFAYTSIKSK
VEDAGPGGEI FIGGTGLSSL GTIFRNGLPY GQIIGSKNAR DDAGNLLINP STGLPIRAAK
SDIIGDPNTK YQVGWTNTVN FKNFSLSVLM DYKAGGSLFS STAASLLLRG QLKNSEDREG
MRVIPGVLGD PATYKPLVGD NGQPIKNTIA MSAFQYHFTD GYGAYGADEV NIYDATVVRL
REVSLGYSVP KAFLKRYAKV FGSMRLSVSG RNLWFYAPNM LKGLNFDPEV LSNFADSNIQ
GFDLGASPST RRFGINLNAS F