Gene Slin_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4224 
Symbol 
ID8727983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5089716 
End bp5092907 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389008 
Protein GI284039078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.25512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGAA TTCTAATATT GAGTTTGCTG TTCATAGGCT CAATCTGGTC TACAGCGTGG 
GCTCAGGAAC GGAGAGTAGT CGGCAAGGTT ACGTCGGCAG AGGATGGGTC CGCTTTACCC
GGTGTTTCGG TGGTAGTAAA AGGATCGACG AAAGGGACAA CGACGGATGC CAGTGGTATT
TATAGTCTTA CGGTACCCAG TGGAAAAGGG ACAATTCTGG TATATAGCTT CGTCGGTGTT
ACGACGCAGG AAGTTAAACT CGGCAGTGAG TCGGAAGTGA ATGTAAGCCT GGTGTCGGAC
TCGCGTCAAC TGTCGGAAGT TGTGGTAACA GGGGTTGGGG TGGCTACCTC AAAAACCAAG
TTAGGTATTG CCGTGGAATC AGTATCGGCG AAAGACCTTC CCGCTGCACC AACGGCTTCT
ATCGACCAGG CACTGGTTGG TAAAATTGCC GGTGCTCAGA TCGTGAGTGC CAACGGTACA
CCTGGCTCAA AAGCCAACAT TCTGCTGCGC GGTATCAACA CAATTAACCG GGGTACATCG
CCAATTGTGT TGATGGATGG TGTGCAGGTG GGTTCTACCG ACATTAACAG CCTCGATCTT
AGTACCATTG AGCGGGTTGA AGTTATCCAG GGTGCGGCTG CCGGTACACT GTACGGAGCA
CAGGGTGCCA ATGGCGTTAT TCAATTGTTC AGCAAACGGG GTAAAGATGG CCCGGCGCAG
ATCAGCTTCT CGAACAGCTA TGCCAGCAAC ACCTACATCA ACAAAGGTGG TGTGGCCCAG
GCCGATAAAC ACGCCTTTGT AACGGATGCC AGCAACAATG TAATTGGTGT GTCCGGTAAG
CCCCTTGCTT TCGATCCGGC AACCAGCACC TGGAGCGAAA ACGTACAGTA CAATGCCCTG
GATGTAAACA GCCAGGCCAA CAAGCCCTAC GACCAGAACC TGAAGTTCTA CGATTATTAC
AAGATGTTTT TCCGGCCTTC GGAAACGATC AACAACTCGC TGAACATCTC GGGCGGAAGC
GGTAAAGCCG ATTACAGCAT CACGGCTTCC AATAGCTACC AGTCGACCAT TCTGAAAAAC
AATGGTGCTT TCAACCGGAG CAACCTGGTT AGTAACATTG GGATGACCCT TGCCAAGAAT
TTGACCTTGC GCAGCATATC TCAGCTGGTA TATACCAAGA ATACGATCAA CACCTACGAC
CGGGCTGTCT GGTACGATAT CAACAACACC CGTCCGTTCG ACAATTACGA TTATGTTGAT
CCCGACGGAA ACTATGCCGC TTATTTCGGC AGCGCGTCGG GCGTAAACGG GTACAACCCG
AACTACCGTC TGCAATACCG AAATCACGTT GATAACAAAG TAGACGTTAT TCAGAGCGTT
GAACTGGACT ACAAGCCCAT TAAATACCTG GACCTGAATG CCCGCTATGG TCTGAACTAC
ACCCAGGAAG GCGAACGCTA TACGTATGGC AACCAGACCC TGAACCGCAA CATCATTGCC
AACGGAGCCG GTTATGCTAC TTCGCTGAAC GCCAGTGACG CCAAAGGGGA GATTTCGACC
TACGATTACA AAACCGTATT CCAGAACTTC CTGGCCAGTG CCTTCATCAA AACGGATTTC
CAGGAAGATT TTAAGCTGAA TATTCCCATC CGTACATCAA CGCAAATCTC GTTCGACTAT
CGGAAAAACA ATTACAAGGA ATTCGATACC TATGCGCTGG GCGTACCTAC CTACAACCCG
TATACAGCCG CTCAGGCCAG TACCTACCGG GTTTCGCTTG ATAACAGCAC GCCATTCGTA
ACGTACGGAT ACCTGATCAA TCAGCACATC GAATACGGTG AACTGCTGGG TGTTACCGCC
GGTTTCCGTT CCGATTATTC ATCGGCATTC GGACGTGGGT CGACGCCGTT TACCTTCCCG
CACGCCGATG GATATATCCG TCCTTCGTCG CTTACGTTCT GGCAGAACAG CGCCCTGGGA
ACGTACGTGC CTGAGTTCAA ACTGCGGGCG GCTTACGGAC AGGCGGGTAT TCAGCCTAAG
CCGTTTGACC GGTACGTAAC GCTGGGTACG CGTACGTTTG GAGCTAACAA CGTCTTTTAT
AACACTGTTA CACAAAGTAA CCCGGATTTG GGCGTAGAGG TGTCGAAAGA ACTTGAACTG
GGAACGGACT TCACGATTAA AGGCGGCAAC GGCGATTGGC TCCGTAAGCT GAACTTCTCG
TTCAGCTACT GGGATCGTTC TACCGACAAT GCCATCTATA ACGTAAACTC GGCTCCGTCT
ACGGGTATCG GTACGGTGAA AGACAATGCC TTCTCGCTAT CTTCGAAAGG TACCCAGTTC
TCCCTGAACG CGACCGTTTA CCGGGGGCGT AGCTTTACCT GGAATTTCAC GACGAACTTT
GGCCATCAGA GTTCACAGAT CGATGCCGTA AAAGGAAATC AGCAGATCGT TGTGACATCC
AGCGCGGGTA GTACGAACTA TGTACTGAAA GCCGGTCAGA AAATTGGTCA GCTGTTCGGA
TTCCTGGCTA TTCATAGCCT TGATCAGGTG TTGCCCGATG GCAAGCCCGC CATTGCCGAA
AGCGCTAAAG CAAATTACGA AGTGGCCAGC AACGGCTACG TGGTCAACAA AACGACCAAG
CAGCCTCTGT TTAGCTCGGC TCAGTACAGC TTCGGCGATC CGAACCCTAC GTTTGTGTCG
TCGTTCATCA ACGATATTTC GTTCCGCGAC ATCGTAACAC TGAACTTCCA GTTCGACTGG
ACGCAGGGCA GCCACATTTA TAACCAGACG AAAGAGTGGA TGTACCGCGA TGGTATCCAC
AAGGATTACA CCAACCCAAT TACGATAAAT GGTCAAACGG GTGCCTGGAC GGCCTTCTAC
CGGGGTGTTT ATCAGGCGGG TGCCAACAAC GGAACGAAAG ATTACTTCTA CGAAGATGCT
TCGTTTGTAC GGCTTCGGAA CGTTGCGCTG GGTGTAGAGT TGACCAAGCT CATCAAGTTG
CCGATGCGCC GGTTACAGGT TGTCTTCAGC GGTCGTAACG TGCTCACCTT CACGAAGTAC
ACAGGATTCG ATCCTGAGGT AAGCTCCGGC CAGACAACGG GTAACGAAAG TTCGGCATGG
GATCGGGGTA CGGACCATAA CACGACGCCA AACAACCGTT CGTATCAGGT TTCTCTCAAT
TTTGGCTTCT AA
 
Protein sequence
MSRILILSLL FIGSIWSTAW AQERRVVGKV TSAEDGSALP GVSVVVKGST KGTTTDASGI 
YSLTVPSGKG TILVYSFVGV TTQEVKLGSE SEVNVSLVSD SRQLSEVVVT GVGVATSKTK
LGIAVESVSA KDLPAAPTAS IDQALVGKIA GAQIVSANGT PGSKANILLR GINTINRGTS
PIVLMDGVQV GSTDINSLDL STIERVEVIQ GAAAGTLYGA QGANGVIQLF SKRGKDGPAQ
ISFSNSYASN TYINKGGVAQ ADKHAFVTDA SNNVIGVSGK PLAFDPATST WSENVQYNAL
DVNSQANKPY DQNLKFYDYY KMFFRPSETI NNSLNISGGS GKADYSITAS NSYQSTILKN
NGAFNRSNLV SNIGMTLAKN LTLRSISQLV YTKNTINTYD RAVWYDINNT RPFDNYDYVD
PDGNYAAYFG SASGVNGYNP NYRLQYRNHV DNKVDVIQSV ELDYKPIKYL DLNARYGLNY
TQEGERYTYG NQTLNRNIIA NGAGYATSLN ASDAKGEIST YDYKTVFQNF LASAFIKTDF
QEDFKLNIPI RTSTQISFDY RKNNYKEFDT YALGVPTYNP YTAAQASTYR VSLDNSTPFV
TYGYLINQHI EYGELLGVTA GFRSDYSSAF GRGSTPFTFP HADGYIRPSS LTFWQNSALG
TYVPEFKLRA AYGQAGIQPK PFDRYVTLGT RTFGANNVFY NTVTQSNPDL GVEVSKELEL
GTDFTIKGGN GDWLRKLNFS FSYWDRSTDN AIYNVNSAPS TGIGTVKDNA FSLSSKGTQF
SLNATVYRGR SFTWNFTTNF GHQSSQIDAV KGNQQIVVTS SAGSTNYVLK AGQKIGQLFG
FLAIHSLDQV LPDGKPAIAE SAKANYEVAS NGYVVNKTTK QPLFSSAQYS FGDPNPTFVS
SFINDISFRD IVTLNFQFDW TQGSHIYNQT KEWMYRDGIH KDYTNPITIN GQTGAWTAFY
RGVYQAGANN GTKDYFYEDA SFVRLRNVAL GVELTKLIKL PMRRLQVVFS GRNVLTFTKY
TGFDPEVSSG QTTGNESSAW DRGTDHNTTP NNRSYQVSLN FGF