Gene Slin_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0021 
Symbol 
ID8723749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp19274 
End bp22204 
Gene Length2931 bp 
Protein Length976 aa 
Translation table11 
GC content51% 
IMG OID 
ProductTonB-dependent receptor 
Protein accessionYP_003384894 
Protein GI284034964 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCATA ATTTGGAGGA AAAACCAGTG GCTGAAATAA CCGTTAGCGG ACGGGTTACG 
GATGCCACTA CCAACGAAGC CCTCGCGGGT TGTAACGTTG TACTGAAAGG TACACAGAAA
GGAACAACGA CGGATGCCAA TGGCGATTAT AAAATTGTAG TGCCCGATGG TAATGCCACA
CTGGTGTTCG GGTTTATCGG TTTTATTTCT CAGGACGTAC CCGTAGGAAA CCGCACGGTC
ATCAACGTAT CGCTGAAAGC GTCGGCTTCG GAGCTGGCGC AGGTCGTTGT TATTGGTTAC
GGTACTACCA CAAAGAAAGA CGTAACCGGA TCGCTCAAAA CAATCAAGAG TACAGATTTT
AACCGGGGTA TCATCAACTC ACCTGAGCAG CTTTTGCAGG GTAAAGTAGC GGGCGTAAAT
GTTACCTCAG CCAGTGGTGA GCCGGGGGGT GTTCAAAATA TTACGGTACG TGGGCCGGGG
GGTGTTCGGA CAGGTAGTAC GCCACTCTTC GTACTGGACG GTATTGCGCT CGATAACTCA
AGTACGGGTG GTGCAACTAA CCCATTAAAT TTTCTGAACC CACAGGATAT CGAAGCCATC
GATGTTCTGA AAGATGCCTC TGCAACAGCT ATTTATGGTG CACGGGGTGC TAACGGCGTA
ATTCTGATCA CGACCAAGAA AGGCAAAGCT GGTGCAACTA ACCTTACTCT TTCCTCAAAC
ATAGGGATTT CGAACATGGC CCGCCCCATT GCGCTGTTTT CAACGGATGA GTACAAGCAG
CAGGTAGCGG CTGTAGGCGG TGTGGTCGAC GATCAGAAAG GATCTACGGA TTGGCAGCGT
GAAATCAGCC GGACGGCGGT TACCCAAAAT CATAATCTGT CGTTCGGTGG CGGTGCCGAC
CGCCTGACCT ATTATGGCTC TATTGGCGTG CAGGACCAGC AGGGTATCCT GAAAAATAGC
AGCCTAAAAC GGTACACCGC CCGTTTCAAC GCTTCACAGA AATTTCTGGA AAATCGACTG
GTGCTGGATG TCAACATGAC GGCCTCGCAA ACGATCAACG AGCGTCCGCC AATCGAAGGA
ATAATCGGAG CGGCTCTGTC GGCCAATCCA ACGTATCCGG CGCGCGATGC CAATGGCAAT
CCAGCCCGCT ATCAGGCCTT CACCAACCCA TTGCTGGCAT TGAATCTGAA CAAGGACCTG
ACAACCATCA ACCGGGTTGT GGCGTCGGTA TCACCATCGT TCAGCATCAC CAAAAACCTG
GTTTACAAGC TGAATCTGGG CGTTGATAAT TCAAGCTCCA CGCGCGACCA GCAGTCTTAC
GCTAGTACGG TGCCGCAGCA GGACGGCCGC CTGGATGCTA CCTACCTTAA CAACCGGAAC
GTACTGGTCG AAAACTATTT CACCTACACC AAAACTTCGG GCGATCATAA CCTGACCGCT
TTGCTGGGGC ATTCGTATCA GAAGTTTACG ATTCAGGGGC GTAACTGGAG CATCAACAAA
TTTCCAATAT CACCCATTGA ACCCGCCAAC AACCCTGGCC TGGGGCAAGA CCTGACGCTG
GCCAACAACC GTCCCGGCGG CTATGCGATC ATCAATGAGT TACAGTCGTT CTTCTCACGG
GTTAATTACG CCTATAAAGA TCGCTACCTG TTTACGGCTA CGGTTCGTGC CGATGGGTCG
AGCAAATTTG GCGCAAACAA CAAGTATGGT GTATTCCCTT CGTTCTCGGG CGGCTGGCGG
CTGTCGGAAG AAGGGTTCCT TAAATCGGGA CCATTCTCGG ATCTGAAACT CCGCGCCGGT
TGGGGACAAA CGGGTAACCA GGAAATACCG TCGAAGATTA CCCAGGCCCT GTTCACCTCG
AACGTGTCGG CCTCAACCAG TTACCCGCTC GATGGGTCAA CCAACTATCC GGCGGGAACC
ACGTATACCC GTCTGGCCAA TCCTGACATT CAATGGGAAG TATCGACCCA AACCGACCTG
GGCCTTGATT TTGGTCTGTT CCGGGGGGCG TTGACGGGTT CCGTCGATTA TTTCCATAAG
ACATCGGGTA AGATTCTGCT CGAAGTGATT CCTTCCGATC CTATTCAGCC CGCTTCCACC
TACTGGACCA ACGTGCCGAA TATGACCATC ACCAACCAGG GACTTGAGCT TGATCTGAAC
TACCGTTACG CCAGCACAAG CGGCTTCCGG TTCGACATAG GTGGCAATGT TACGTTCATT
AAAAATGTAG TGAATAATTC GCCATACACG GTTATTACCT CAGGCTCCGC ATCGGGAGCC
GGGTTGACAT CGGCTACGGT AAACGGCTAT GTGAACGGAC AGCCCATCGG AACGTTCTTC
CTGCGGGAAT ACCTGGGCGT TGACGACAAA GGGGTTAACC GATTCAGTGA CATAGACGGT
GATGGAATCG GTGGTACCGA CAAAGACCGG ATTGCTGCGG GAAGCGCCTT GCCAACCCGC
CAGTTTAACC TCAATTTCAG TACGGCTTAC AAAGGTTTCG ACCTAACGGC CAATTTTAAC
GGCGTGTCGG GTAATAAAAT TTACGACAAC ACGACGAATG CGTTCTTCTA CAAAGCACGT
CTGGTAAAAG GGCTGAATGG ACCCGCTGAA TCAATTGGTG AGCCAACCGA GTCAATCAAT
AACCCGGCTC CTGTATCGAC ACGCTTCCTG AGAGACGGCG CTTTCTTCCG GCTCAATAAC
CTGTCGCTGG GCTACAATCT AAATCCCCGT ACCCTTGGTA TGAATCGCTG GATTTCAAAC
ATCCGACTAT CGGTAACGGG TCAGAACTTG TTTGTTATCA CGAAATACAA AGGGTATGAT
CCTGAAGTAA ACATCGACCG CACGGTTAAT GGTATCTCGT CGTATGGAAT CGACTACCTC
AGTTATCCTA AAGCGCGTTC GTTTGTGTTT GGCTTAAATC TTACCTTCTA A
 
Protein sequence
MAHNLEEKPV AEITVSGRVT DATTNEALAG CNVVLKGTQK GTTTDANGDY KIVVPDGNAT 
LVFGFIGFIS QDVPVGNRTV INVSLKASAS ELAQVVVIGY GTTTKKDVTG SLKTIKSTDF
NRGIINSPEQ LLQGKVAGVN VTSASGEPGG VQNITVRGPG GVRTGSTPLF VLDGIALDNS
STGGATNPLN FLNPQDIEAI DVLKDASATA IYGARGANGV ILITTKKGKA GATNLTLSSN
IGISNMARPI ALFSTDEYKQ QVAAVGGVVD DQKGSTDWQR EISRTAVTQN HNLSFGGGAD
RLTYYGSIGV QDQQGILKNS SLKRYTARFN ASQKFLENRL VLDVNMTASQ TINERPPIEG
IIGAALSANP TYPARDANGN PARYQAFTNP LLALNLNKDL TTINRVVASV SPSFSITKNL
VYKLNLGVDN SSSTRDQQSY ASTVPQQDGR LDATYLNNRN VLVENYFTYT KTSGDHNLTA
LLGHSYQKFT IQGRNWSINK FPISPIEPAN NPGLGQDLTL ANNRPGGYAI INELQSFFSR
VNYAYKDRYL FTATVRADGS SKFGANNKYG VFPSFSGGWR LSEEGFLKSG PFSDLKLRAG
WGQTGNQEIP SKITQALFTS NVSASTSYPL DGSTNYPAGT TYTRLANPDI QWEVSTQTDL
GLDFGLFRGA LTGSVDYFHK TSGKILLEVI PSDPIQPAST YWTNVPNMTI TNQGLELDLN
YRYASTSGFR FDIGGNVTFI KNVVNNSPYT VITSGSASGA GLTSATVNGY VNGQPIGTFF
LREYLGVDDK GVNRFSDIDG DGIGGTDKDR IAAGSALPTR QFNLNFSTAY KGFDLTANFN
GVSGNKIYDN TTNAFFYKAR LVKGLNGPAE SIGEPTESIN NPAPVSTRFL RDGAFFRLNN
LSLGYNLNPR TLGMNRWISN IRLSVTGQNL FVITKYKGYD PEVNIDRTVN GISSYGIDYL
SYPKARSFVF GLNLTF