Gene Slin_5062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5062 
Symbol 
ID8728827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6178484 
End bp6181597 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content53% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003389836 
Protein GI284039906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATA AATCCTACAA AATTAGCCGA TCCGGTAACA ACGTAGGTCA GGTTGGAGCT 
TGCCAACCCC AGTCACTTAA ACAGAAGGCT TTTTCAGGCT TCCTGTCGAC GCTGGTTGTG
TTGTTGCTGT TGAGTAATTC AGCAGCATGG GCACAGGAGC GAACGGTGAC AGGTAAAGTA
ACCGACCCCA CAGGGTCCGT TTTGCCGGGT GTGAGTATTC AGGTGAAAGG CACCCAACGC
GGTACCAACA CAAACGGAGA AGGGGTCTAC ACGCTGACGA ATGTACCCGA CAACGCAACG
CTGGTGCTTA GTTTCATTGG CTACACAACG CAGGAAGTGG CTGTAGGCAA CCGCACAACG
GTTGATGTAC AACTGGCCGA CGACACCAAA GCGCTGCAGG AAGTAGTAGT TGTTGGATAT
GGCACGCAAC GCGCCAAAGA CGTTACCGGC TCGGTGGCGA CGATAGGCCC GAAAGATTTC
AACAAGGGTG TAATTGCCTC GCCGGAGCAG CTTTTGCAGG GCCGTGTGGC GGGTGTGCAG
ATTACGCCAG CCAGTGGTGA GCCGGGGGCG GCCAACAACA TCCAGATTCG GGGTGCTGTT
TCGCTACGCG GAGGTAACAC ACCTTTATAT GTTATCGACG GTGTTCCGCT CGATGGCGGT
GATTTTAGCA GTGGTACGCC GGATTTTGGT ACGGGTACGA CTACGGCCCG TAACCCACTA
TCGTTCCTGA ACCCGAGCGA TATTGAAAAC ATTTCGGTGT TAAAAGATGC GTCGGCTGCG
GCTATTTATG GAGCACGGGG TGCCAATGGT GTGGTGCTGA TTACGACCCG CAAGGGCCGT
GCGGGCGCAC CGCAGTTCAA CTTCTCGGCG TCGGGGGCTG TTTCTTCGTC GCTGAAGCGG
TACGATTTGC TGTCTCCGTC CGATTTTTTG GCTGGAGTGA AAGCCGCCGG TGGCGATCCG
ACCCTATCGA CGGTCAACGC GGGCGCTAAC ACTAACTGGC AAAATGAAAT CCTGCGTACC
AGCGTTTCGC AGATCTATAA TGCCAGCTTC GGTGGTGGTA CAAACGATAC GCGGTATTTG
TTCTCGCTGG GTTACCAGGA CCAGCAGGGC TTGGTAAAAG GCACGGGTCA GCAGCGGGTT
ACGGGACGTA TCAACGCGTC GCAGGATCTG TTCAACAAGA AGCTGACGCT GGCGGTTAAT
GCAACAACCT CGTCGGTAAC GGATCAGTAC GCTATGACAG GCAACCAGGC CGGTGCGCTG
GGTAACCTCT TCGGAGCCAT GATCGGGGCT AACCCAACAT ACCCGGTATT CAGAAATACG
AGCGATACAT CGTCGTATTA TCAGTTGGCG GGTGGTTCGT ACCGTAACCC ACGGGCTATG
CTCGATTATT ACCACGACCG GGGCGTAACT AACCGGACAC TGGCAAACGT CAGCGCTACC
TGGAACATCC TGACGGGGCT GTCGCTCAAG GCAAACTTTG GTGTTGATAA CTCGACGTCT
ACCCGATCAA CGTCGATCGA CTCACGCCTG AATGGTCAGT TTACCGTACC ACTGGGCTCG
GTAACCAACC AGGTATACGC CGACGCAACA ACGGGTCTGG GTGGTGCGGC TTACATCAAT
TCGCTGAACC GCCTGTCTAA ACTGGTCGAA TACACGGCTA ACTACAACCG GGATCTTGGC
CCTGGTAAAC TGGAGGCTGT AGCCGGTTTT GCGTACCAAA CATTTGGTAC CCGAACCAGT
TATGTAGCGG CCAGCCGTTT CCCGTTCGAT GAATCGGCCA TTTCGTATAC AGACAACATC
GGGGCGGCAA ACACCCTCAC CGGAACGGCT ATCGGGGGTG GCTCATCACG TGCTCAGAAT
GACTTACAGT CTTACTTTGC CCGCGCCAAT TATAACTTCA AAGAGAAGTA TTTGCTGACG
GCGACCGTAC GGGTAGATGG TTCATCACGT TTTGGGGTAA ACAACAAGTA TGGTACCTTC
CCATCGGTAG CGGGTGCATG GCGGATATCG CAGGAGAGCT TCATTCCGAA AAATATTTTT
GATGACCTCA AAATCCGGGC GAACTACGGT ATTGTAGGTA ATCAGGATTT CACCGGGGGC
GCGTCGAAAA TCATTTATAC CTACAACAGT TCCGGTTCGC AGATTCAGCA GAACAACCCG
AACCCTGATC TGAAGTGGGA GCAAAACACC ACAACCGGTG CGGGTATCGA CTTTAGCGTG
CTGAAAGGCC AGTTGTCCGG TTCGATTGAT TATTATCACC GTGCTGGTTC CAACACACTG
CTTCAGGTGT TCTATGCCCA GCCTGCGCCT GTAAACTACA AATGGATCAA CCTGCCCGGC
CAGATTGTGA GCCAGGGTAT TGAGCTGAAC TTGATTTATC AGGTTTTCCA GAAGCAACAG
TTTGGCTGGG AAGCCGTATT CAATCTGACC ACGCTTGATA TTAAGGCGCA GAACATCGGT
ACCGATCAGG CCGTGGGTGC TATCAGTGGT CAGGGCCTTT CGGGAGCTTA TGCCGAACGG
ATCACCAGTG GCTACGCGCC ATTCTCGTTC TTCATCCCCA AGTTTACGGG TTTCGATGCC
AACGGATACT CTACCTATGC GGATGATGGC CGGTCGACGT ATCAGGGCAG CCCATTTGCC
AAATTACGGT TAGGTTTGAC CAACAACTTC ACGTTTGGTG CCTGGACAGC GAGTCTGTTC
GTGAATGGTC AGTTTGGCGG CAAAATTTAC AACAACACGG CCAACGCCCT GTTTGCGAAA
GGTGCCCTCA AAAATGCCCG GAACGTAACG TATGACGTAG CGAACAGCAC TGAAAACGGC
TTGAACCCAG CATCGGTATC GACCCGGTTC CTGGAGAAAA GCGATTACGT TCGGGTTACA
AACCTGACCA TCTCGCGCCG GTTCGAACTG CCGCAGGGCG GATTTGCCAA GTCGCTTTCG
CTGTCGCTGA CCGGTCAGAA CCTGTTCATC TTTACGGGCT ATACCGGCCT GAATCCGGAT
GTAAATACGG TGACTTATAA CGGGAACGGA AACGGCATTC CATCGCTGGG TATCGACTAT
ACGCCGTATC CAACACCCCG CACAGTGACC CTAGGCTTAA ATGTTGGTTT CTAA
 
Protein sequence
MMNKSYKISR SGNNVGQVGA CQPQSLKQKA FSGFLSTLVV LLLLSNSAAW AQERTVTGKV 
TDPTGSVLPG VSIQVKGTQR GTNTNGEGVY TLTNVPDNAT LVLSFIGYTT QEVAVGNRTT
VDVQLADDTK ALQEVVVVGY GTQRAKDVTG SVATIGPKDF NKGVIASPEQ LLQGRVAGVQ
ITPASGEPGA ANNIQIRGAV SLRGGNTPLY VIDGVPLDGG DFSSGTPDFG TGTTTARNPL
SFLNPSDIEN ISVLKDASAA AIYGARGANG VVLITTRKGR AGAPQFNFSA SGAVSSSLKR
YDLLSPSDFL AGVKAAGGDP TLSTVNAGAN TNWQNEILRT SVSQIYNASF GGGTNDTRYL
FSLGYQDQQG LVKGTGQQRV TGRINASQDL FNKKLTLAVN ATTSSVTDQY AMTGNQAGAL
GNLFGAMIGA NPTYPVFRNT SDTSSYYQLA GGSYRNPRAM LDYYHDRGVT NRTLANVSAT
WNILTGLSLK ANFGVDNSTS TRSTSIDSRL NGQFTVPLGS VTNQVYADAT TGLGGAAYIN
SLNRLSKLVE YTANYNRDLG PGKLEAVAGF AYQTFGTRTS YVAASRFPFD ESAISYTDNI
GAANTLTGTA IGGGSSRAQN DLQSYFARAN YNFKEKYLLT ATVRVDGSSR FGVNNKYGTF
PSVAGAWRIS QESFIPKNIF DDLKIRANYG IVGNQDFTGG ASKIIYTYNS SGSQIQQNNP
NPDLKWEQNT TTGAGIDFSV LKGQLSGSID YYHRAGSNTL LQVFYAQPAP VNYKWINLPG
QIVSQGIELN LIYQVFQKQQ FGWEAVFNLT TLDIKAQNIG TDQAVGAISG QGLSGAYAER
ITSGYAPFSF FIPKFTGFDA NGYSTYADDG RSTYQGSPFA KLRLGLTNNF TFGAWTASLF
VNGQFGGKIY NNTANALFAK GALKNARNVT YDVANSTENG LNPASVSTRF LEKSDYVRVT
NLTISRRFEL PQGGFAKSLS LSLTGQNLFI FTGYTGLNPD VNTVTYNGNG NGIPSLGIDY
TPYPTPRTVT LGLNVGF