Gene Slin_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1804 
Symbol 
ID8725541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2179333 
End bp2180607 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content57% 
IMG OID 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_003386648 
Protein GI284036718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCC AACCCGAACA CATCGCCGAT TATAACCAGA CACTACTGGA AGGTCACCAG 
CGTAAACTGT CCTACTGGAC GCAGGAGGTG GCCAAGGCGG AACCCATTAT TCAGAAACTA
ATTGATTTTC AGATTGCTAT CCCGAGCTGG GCATTGGGAA CGGGCGGCAC TCGTTTCGGC
CGGTTTCCGG GTGGGGGAGA GCCTCGCAGC CTGGAAGAGA AAATTGCCGA TGTAGGCCTG
CTGCACGCCC TGAACCGCGC CAGCGGTGCC ATATCCCTGC ACATCCCCTG GGATATTCCA
ACCGACCCGG AAGCGATCAA ACAACTGGCG GCCCAGCATG GCCTGCGGTT TGATGCCGTT
AACTCCAACA CCTTTCAGGA TCAGCCCGAC GCCGAATTGA GCTATAAGTT CGGATCGCTT
CAGCATACCG ACCCCGCCGT TCGTCGGCAG GCTATCGACC ATAACATTGA GGTTATCCGA
CACGGTGTGG CGCTGGGCTC TACGGCCCTG ACCGTCTGGT TGTCGGATGG CTCCTGTTTC
CCCGGTCAGC TTAATTTCCG GAAAGCCTTC GACCGCACAC TGTCGAGTCT GGAAGAAATC
TACGCGGCTC TCCCCGACGA CTGGAAAGTG TTTGTCGAGT ACAAAGCGTT CGAGCCCAAT
TTTTATTCGA TGACCGTTGG CGATTGGGGC AGCAGCTTCC TCTACGCGAA CAAACTCGGC
CCGAAAGCGT ACACGCTGGT CGATCTGGGT CACCACCTGC CCAATGCCAA CATCGAGCAG
ATTGTCGCGA TTCTTCTCCA CCAGGGTAAA CTGGGTGGTT TCCATTTCAA CGACTCCAAA
TACGGTGACG ACGACCTGAC GGCGGGCAGC ATCAAACCGT ATCAGCTTTT CCTGATCTTT
ACCGAACTGG TCGATGGTCT GGATGCCAAA GGGATGAACC ACGCTACCGA CCTCGGTTGG
ATGATCGACG CCAGCCATAA CGTAAAAGAT CCGCTGGAAG ACCTGCTTCA GTCGGTCGAA
GCCATACAGC TGGCGTACGC TCAGGCCCTA TTAGTTGACC GCGAAGCGCT GGAAGAAGCC
CGCGAGGCCA ACGATGCCGT TCGGGCACAG GAGATTTTAC AGGATGCTTT CCGCACGGAT
GTTCGGCCGC TGGTGGCCGA AGCGCGTCGG CGGGCCGGTG CATCGCTGAA TCCACTTGGC
TTATACCGGC AGCTGGACGT TCGCCAGCAA CTGATCGGTG AGCGTGGTGC CAAAACAGTT
GCCACGGGAC TTTAA
 
Protein sequence
MNLQPEHIAD YNQTLLEGHQ RKLSYWTQEV AKAEPIIQKL IDFQIAIPSW ALGTGGTRFG 
RFPGGGEPRS LEEKIADVGL LHALNRASGA ISLHIPWDIP TDPEAIKQLA AQHGLRFDAV
NSNTFQDQPD AELSYKFGSL QHTDPAVRRQ AIDHNIEVIR HGVALGSTAL TVWLSDGSCF
PGQLNFRKAF DRTLSSLEEI YAALPDDWKV FVEYKAFEPN FYSMTVGDWG SSFLYANKLG
PKAYTLVDLG HHLPNANIEQ IVAILLHQGK LGGFHFNDSK YGDDDLTAGS IKPYQLFLIF
TELVDGLDAK GMNHATDLGW MIDASHNVKD PLEDLLQSVE AIQLAYAQAL LVDREALEEA
REANDAVRAQ EILQDAFRTD VRPLVAEARR RAGASLNPLG LYRQLDVRQQ LIGERGAKTV
ATGL