Gene Slin_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1839 
Symbol 
ID8725576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2221608 
End bp2222858 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content49% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003386683 
Protein GI284036753 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.877105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.633197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATAC GAATCGCTTC TCAAATAGCC CAAACGCACC TGCTGGCTAA GAAACGTCAG 
ACACTTGTCG CTATGCTGGG GGTCACCTTC GGCATTGCAA TGTTCATTAC CATGATTTCG
TTTATGCAGG GCGTTAACCA GTTTCTGGAA GATTCGGCGC TGGATGCCAG TCCGCATATT
CGGATGTATA ACGAGGTAAA CACCCAGCGG CCGGGCCTGA TCGAACAGCT CAACCCCGGC
AAGTTTAACG TCATTTACCA CCAAAAGCCG AAGGATGAGC AGTCGCGTAT CAAAAACGGG
ATGACCATTG CCGGACGCAT TGAACGGGCA TCGGGTGTGT TGGGCGTTTC GCCCCAGGTG
GCTACGCAGG CATTTTATAA TAACGGCCCA ATTCAGATTT CCGGGACAAT TTCGGGAGTG
GATATCGACC GCGAAAACCG GCTCTATAAA CTAACGACCC GGCTTAAATC GGGTAGCCTG
AACGCGCTGA AAACCAATCC CGACGGCCTA ATCATGGGGG CCGTACTAGC CGACAAGCTC
AACGTTCGGG TAGGTGATAA AGTGACGGTA ACAACACCCA GAGGAGGCAT CAGAACCCTG
CGGGTGGTGG GCACATTTGG CTTTGGCATT GGTACAATCG ACAATACCAA GAGCTACGGA
AATCTCTCTA CAGTTCAGGA AATGCTGCAA CGCGACCCCA GCTACATTAC CGACATTCAT
ATTAAAATGT TCGACCCCTT ACAGGCAATA CCTTTTGGGA AACAGCTGCG GGCCATTTAT
TGGTACTATA CCGAAGACTG GGCAACGGCC AACACGGCCA TACTGGCGGG TGAAAAGATC
CGAAATATGC TGACTTATGT AGTGTCGTTC ACGCTGCTGG TAGTTGCGGG CTTCGGTATT
TACAACATCA TGAATATGAC CGTTATCAAC AAAATCAAGG ACATCGCCAT TTTGAAAGCC
ACCGGTTTTG AGGGTCGCGA CATCATCGCT ATTTTTCTCT TTCAAGCTGT TTTCATTGGT
GTTTCGGGTG GCCTGTTAGG GCTGGGGATC GGTTTCGGGC TCAGTTATCT GCTGTCAATC
ACCCCATTCG ATGCCGGTGG GTTCATCAGT ATTAAAACAT TCCCGGTCAT TTTCGAGCCA
AAGTATTATA TAATGGGGCT GTTATTCGGT GTGATAACCA CTGTTCTGGC AGGGTATTTC
CCTTCCCGAA AAGCCTCTCA AGTTGACCCC GTTTCCATTT TAAGAGGATA A
 
Protein sequence
MDIRIASQIA QTHLLAKKRQ TLVAMLGVTF GIAMFITMIS FMQGVNQFLE DSALDASPHI 
RMYNEVNTQR PGLIEQLNPG KFNVIYHQKP KDEQSRIKNG MTIAGRIERA SGVLGVSPQV
ATQAFYNNGP IQISGTISGV DIDRENRLYK LTTRLKSGSL NALKTNPDGL IMGAVLADKL
NVRVGDKVTV TTPRGGIRTL RVVGTFGFGI GTIDNTKSYG NLSTVQEMLQ RDPSYITDIH
IKMFDPLQAI PFGKQLRAIY WYYTEDWATA NTAILAGEKI RNMLTYVVSF TLLVVAGFGI
YNIMNMTVIN KIKDIAILKA TGFEGRDIIA IFLFQAVFIG VSGGLLGLGI GFGLSYLLSI
TPFDAGGFIS IKTFPVIFEP KYYIMGLLFG VITTVLAGYF PSRKASQVDP VSILRG