Gene Slin_4616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4616 
Symbol 
ID8728380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5606549 
End bp5608972 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389393 
Protein GI284039463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.413807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAA ACTATTTCAC CACCGCCCTG CGTGTCTTAA GACGCAACTG GAATTATACA 
ATCATCAACG TTGTCGGATT GACGTTCGGA CTGGCGTGCT GCCTTGTTCT TTTTCTGGCG
ATTCGTTACG AACTGAGCTA CGACCGGCAC CACGCCAACG CCGACCGGAC CTACCGGATC
ATTACATACA ACCGAAATTC GGGTGGCGAT GGCCGCAATA CGGGCATTCC CTTACCGGCA
CTGGCGGCAC TGCGAAACGA CTTCCCGGAG TTGACGCATC AGGTGACGAT GGCTTATGAA
CTCTACGGCG GGCTCGTACG TGTGCAGGAT CGGCAGGGGA ACAAGTTTCA GGAAAAAGAA
GGGGTCATCG CCTTTGTCGA ACCCGAATAC TTCCGGCTGT TTGACTACCA ATGGGCAAAT
GGCAGCCCCA AAACGGCGGT CAGCAACCCG CAAACGGCGG TACTGTCGGA GCAGATGGCC
CAGAAGTATT TTGGCAATGT CGACCCAATA GGCAAAACCA TTCGTATCGA CAACAAAGTC
GATTTTGTGG TGACGGGCGT TGTTCAGAAT CCACCAACTA CGAGCAGCCT GCCGTTTGAG
GTTATGCTGT CGTTCCCTTC GCTCAAACAA TACGGGACGA ATGGCGGCTG GGACGACTGG
CAGTCGAACT ACAGCGGTGC CCAGATTTAC ATGGCCTTAC CGGCAAACGT GACATCGGCG
CAGATGGAAC GTAAGCTCGT GCCGTTCCTT CAAAAATACA TGCGCCCGGA GGATGCCAAA
GACCTTCAAT ACGAGTTGCA ACCCCTCACC AACATTCACT TCGATACCCG CACGGGCAAC
TCAGCCAACC GAACAGTGAG CAAACAGATG ATTTGGGCGA TGGCTCTGAT CGGGCTGTTC
GTTCTGATAA CGGCCTGTGT TAATTTCATC AATCTGGCTA CGGCCCAGGC CATTCGTCGG
GCCAAAGAGG TGGGCGTCCG GAAAGTACTT GGTAGTTCGC GGACGCAGCT GGTCCGGCAG
TTTTTAAGCG AAACGGGCCT ATTGACGGGT CTGGCCATCG TACTGGCTTT TGTAGTCGCC
AATCTGTCGA TGCCATATGT GTCGGAGCTG CTCGACATCA ATGCAAAGTC GTTAACGCTA
TTCGACCCCG GCGTTGTGTC GTTTGTACTC GTACTAGCGC TGCTGACCAC GGTGCTGGCG
GGGTTTTATC CGGCCCTGGT GCTGTCGGGC TACCAGCCGG TGCTGGCCTT ACGCGGCAAG
ATGCGGATGG CGGGCAGCAG CCAGCTTACG CTGCGCCGGG GCCTGATCGT ATTTCAGTTT
GCCATTTCGC AGGTGCTCTT GATCGGCACC ATCATCGCTT ACAGCCAGAT GAAGTACGTC
CGCACGGCTG ACCTGGGCTA CAACAAAGAC GCCGTGCTCA CGGTCAACAT CCCCGACCGG
AAGCCGGGCC AGCTGGAAGC CCTGCGCGCT AAACTGGTTG GGTTGCCCAA CGTAAAATCA
CTGAGCTACG GCATCTCCAT TCCGTCGTCG GATGGCAACT GGTGGAGCGG CTTCCGCTAT
GAAAACGCTG ACAAAGACGC CGATTTTAGT ATCGTCATGC GCTTTGCCGA TACATCGTAT
ATCAACACCT ATGGTCTGAA ACTCATTGCC GGGCGCATGT ACCAACCCGC CGACACCGCC
CGGGATATGG TTGTAAACGA GTCGTTTGTC AAGAAAATTG GCCTGCACGA CCCGAAACAG
ATTCTCGGCA AACGCATCAG GATTGGGGCT AATAGCCCTC AAAAGGAGAT CGTGGGGGTT
GTTCGCGACT TCAATACCTT CTCGCTCCAT CAGGAAACCA ACGCCTGTGT ACTCACCAAC
CGCCGGGACG CTTATCATTC GCTCGGCATC AAACTCTCGA CCGGGCAGGG CAGCACCGAA
GCCATTCATA GCCTGATTGG CGAGGTAGAA ACGGCCTGGA ACGCCACCTT CCCGGACTTT
GTCTTCAAGT ATGAATTTCT GGATCAGGCC CTCAACAGCT TTTACAAGAG CGAAGAGCGC
ATGTACGCCC TGTTCCGGCT GCTGGCGGGT ATCGCCATTT TTATCGGCTG TCTGGGTCTC
TACGGCGTGG TAGCCTTCAT GGCCGAAGCC CGCACCAAAG AAGTGGGCAT CCGCAAAGTG
CTGGGGGCAT CGGTTGGCAA TATTGTGAGC CTGTTCTCCA CCGATTTTGT AAAGCTGGTA
TTCATCGCGT TGGTCGTTGC CTCGCCGATA GCCTGGTATG TCATGGGCAA ATGGCTCGCC
GATTTCCCGT ACAAGATCGA TATCGAGTGG TGGATGTTTG CCCTGGCGGG TGTGCTAGCC
ACCGGCATTG CCCTATTGAC GATTAGTTTC CAGAGTGTAA AAGCGGCCCT GATGAACCCG
GTGAAATCGT TGAAAAGCGA GTAG
 
Protein sequence
MLKNYFTTAL RVLRRNWNYT IINVVGLTFG LACCLVLFLA IRYELSYDRH HANADRTYRI 
ITYNRNSGGD GRNTGIPLPA LAALRNDFPE LTHQVTMAYE LYGGLVRVQD RQGNKFQEKE
GVIAFVEPEY FRLFDYQWAN GSPKTAVSNP QTAVLSEQMA QKYFGNVDPI GKTIRIDNKV
DFVVTGVVQN PPTTSSLPFE VMLSFPSLKQ YGTNGGWDDW QSNYSGAQIY MALPANVTSA
QMERKLVPFL QKYMRPEDAK DLQYELQPLT NIHFDTRTGN SANRTVSKQM IWAMALIGLF
VLITACVNFI NLATAQAIRR AKEVGVRKVL GSSRTQLVRQ FLSETGLLTG LAIVLAFVVA
NLSMPYVSEL LDINAKSLTL FDPGVVSFVL VLALLTTVLA GFYPALVLSG YQPVLALRGK
MRMAGSSQLT LRRGLIVFQF AISQVLLIGT IIAYSQMKYV RTADLGYNKD AVLTVNIPDR
KPGQLEALRA KLVGLPNVKS LSYGISIPSS DGNWWSGFRY ENADKDADFS IVMRFADTSY
INTYGLKLIA GRMYQPADTA RDMVVNESFV KKIGLHDPKQ ILGKRIRIGA NSPQKEIVGV
VRDFNTFSLH QETNACVLTN RRDAYHSLGI KLSTGQGSTE AIHSLIGEVE TAWNATFPDF
VFKYEFLDQA LNSFYKSEER MYALFRLLAG IAIFIGCLGL YGVVAFMAEA RTKEVGIRKV
LGASVGNIVS LFSTDFVKLV FIALVVASPI AWYVMGKWLA DFPYKIDIEW WMFALAGVLA
TGIALLTISF QSVKAALMNP VKSLKSE