Gene Slin_3764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3764 
Symbol 
ID8727522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4527273 
End bp4529417 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content48% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003388559 
Protein GI284038629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCAGA CGATGATTGG TAGCTACCTT AAAACCACCA CTCGTAATAT ACTTCGACAT 
AAGCTTTTTG CAACCATCAA TATCATTGGC TTAGCCATCG GTTTGGCGGT GGGGCTACTG
GTCATTACCT TAGTTCATGA CTTGTTCTCC TATGATCGCT TCCATCAAAA GCGAGATCGC
ATTTACCGCA TCATCACAAG TCGGCAAGAT GCACAGCTTG GCAATCAGGA CTATGCTTCA
GCATCGGTCA AAGTAGGGCA AATCTGCCAA CAGCAGATGC CTGGTATCGA AGAGACAGCT
ATCGTACGGC AGGGATATAG GGGCGATGCA ACGGTTCAGC AAACGACCCT TCCCATAAGT
GGCCTTTGGG CAACTCCTTC ATTTCTGTCC GTTTTCACCT TTCCACTGCT GAGAGGTAAC
CCCATGACGG CCCTCAAAGA GCCTTATTCC CTTGTTATTA CCCAGAAACA GGCCCAAAAG
TTATTTGGCG CGGTAGATCC CGTAAACCAG GTTATCCGAC TGGATTCGAC CAACTACAAG
GTGACCGGAG TTCTTCAAGA CATACCGCTG TTCTCTCATA TTCAGTTTGA CGCCTTAATC
TCCTGGTCAA CCCTCCAACA GGCTCGACAG AACGATCCTA ATTTTTTCAG CTGGGACAAC
ATAGCCGATA CGTACGTTTA TTTGTTGTTA CCTAAAAATG GAGATGCCAG CGTTGTGCAA
CGCCAGCTGG ACCAGCTTGA CCAGGTAGAA AATGCCGTCA TTAAACCTAG GGCAATGAGT
ACCCGTTTGC AGCCCTTAAT GAGCATTTTT CTAGGAAAGG ATCTGCGTAA CGAAATTGGT
CATAGTCTCC CCCTTAGTAC CCTGTGGGCC CTACTCGCCT TCGCCTTTAT TGCCATCTTA
ACCGCTTGTT TCAACTACAC TAACCTGTCG ATCGCTCGTT CACTCAGACG CGCACGGGAA
GTGGGCGTTC GCAAAGTGCT GGGTGCCGTT AAAGGGCAAG TGCTCGCCCA ATTTATTGTT
GAGGCCATTG TTACGGCCTT ACTAGCCATG CTACTCGCTT TTGAGTTCTT TCTGGGATTA
CGTTCCGCTT TCCTAGCCTT GGGACCGAGC TTTGCTACCC TGCAACTCTC ATGGGGGGTT
GTACTTGCCT TTGGGCTACT GGCCATCTTA GTAGGAATTC TAGCTGGGTT AGTACCGGCT
ATATCCTTAA CGAAAATTAA TCCACTACAG GTTCTCAAGA ACCTGCAATC CATAACCCTG
TTTCGGCACG TTACATTACG CAAGGCATTG ATCTTTAGTC AGTACACGTT CTCTCTATTT
TTTGTGGCCG TCACGCTCAT TCTCTATCAA CAATACCAGT TTTTTATCCG TCAGGATGTG
GGCTTTCAGA CCGATCATAT CCTCAACATT GCTTTACAAA ACCAGTCCGC TGAAAGCCTC
AAGCAGAAAC TATCCCAGAT TCCCCAGGTG CAACAGATTT CACAATCTCA GCGGATCACC
AGTTTGGGAG CCACTTATCA AACGTATTTA CGCTATAAAG ACCCGCAAGA TTCCCTGGCC
GCCAAGCTGA ATGGTATTGA TCAACAGTAT CTATCGCTTC ATCACTATAA ACTGTTAGCA
GGCCGATATT TTACGCCGGC ACAAACGGAT TCTGCTTCAA ATGAAATCTT GATCAACCAA
GAGTTGATGA AGCATTTTAA CTTAGGCAAT GGCAACCCTC AAAAAGCCGT AGGTTCCCTT
TTGACCACTG GGGAAAAGGC CTACCAGGTG GTAGGTGTGC TGAGCGATTT TCACTACAAT
TCGCTCTATG AGAAAATGGA ACCAGCCTTT TTCCGCTATG CCCCTAAGGA TGCCAGCTAC
CTGAATGTAA AAGTCGCTTC GGGAAAGGAG CCGGCGACCA TCGCCCGCAT TCGCCAAGCT
TGGAAGCAGG TGGATACCGT TCATCCGTTC GTGGCCAGTT GGTATGAGGA TGACATTGAG
GAATTTTATC ACCCGCTTTC GGTCATCAGT AAGCTAATTG GAAGCCTGGC CTTCTTAACC
ATTTTTATTG CGTCCCTGGG CTTGTTTGGG ATGGTTGTCT ACACGGCCGA AACCCGGTTA
AAAGAAATCA GTATTCGGAA GGCGTATGCG ACTGATAGAA TTTGA
 
Protein sequence
MAQTMIGSYL KTTTRNILRH KLFATINIIG LAIGLAVGLL VITLVHDLFS YDRFHQKRDR 
IYRIITSRQD AQLGNQDYAS ASVKVGQICQ QQMPGIEETA IVRQGYRGDA TVQQTTLPIS
GLWATPSFLS VFTFPLLRGN PMTALKEPYS LVITQKQAQK LFGAVDPVNQ VIRLDSTNYK
VTGVLQDIPL FSHIQFDALI SWSTLQQARQ NDPNFFSWDN IADTYVYLLL PKNGDASVVQ
RQLDQLDQVE NAVIKPRAMS TRLQPLMSIF LGKDLRNEIG HSLPLSTLWA LLAFAFIAIL
TACFNYTNLS IARSLRRARE VGVRKVLGAV KGQVLAQFIV EAIVTALLAM LLAFEFFLGL
RSAFLALGPS FATLQLSWGV VLAFGLLAIL VGILAGLVPA ISLTKINPLQ VLKNLQSITL
FRHVTLRKAL IFSQYTFSLF FVAVTLILYQ QYQFFIRQDV GFQTDHILNI ALQNQSAESL
KQKLSQIPQV QQISQSQRIT SLGATYQTYL RYKDPQDSLA AKLNGIDQQY LSLHHYKLLA
GRYFTPAQTD SASNEILINQ ELMKHFNLGN GNPQKAVGSL LTTGEKAYQV VGVLSDFHYN
SLYEKMEPAF FRYAPKDASY LNVKVASGKE PATIARIRQA WKQVDTVHPF VASWYEDDIE
EFYHPLSVIS KLIGSLAFLT IFIASLGLFG MVVYTAETRL KEISIRKAYA TDRI