Gene Teth514_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1115 
Symbol 
ID5876813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1157978 
End bp1159297 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content36% 
IMG OID641541469 
Productextracellular solute-binding protein 
Protein accessionYP_001662749 
Protein GI167039764 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000327347 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA AAATTTTATC CATTATGTTG ACGTTTGCAT TGGTTTTTTC GCTCATGGCA 
GGCTGTGGTA CAAAAAGCAG CGATAATGGA GAAAGTAATA GTACTGCCAC GGCAACAAAG
ACTGTAAAAA TTACTTTGCT AAATTCTAAG GGGGAAATTC AGGCTCAATT AGAAGATGCA
GCTAAAGCTT TTACAAAAGA AAATCCGAAT ATTACTGTAG AAGTCATTCC TGCAACAGCC
GGTCAGTCAC CTTTTGAAAA GGTTACCTCC ATGTATGCAT CTGGTAATGC ACCGACAATG
GCAATGTTAG ATCCAGGTGA TATAGCAAAA TTCAAAGATA AATTCTTAGA TTTAAGCAGT
GAAAAGTGGG TTTCAGATGC AATAGATGGT GCTTTAAATG CAGCTACAGT GGATGGAAAG
GTTATAGCGT TTCCGTTTGC TGTTGAAGGA TATGGACTTA TTTACAACAA GGCTGTACTG
GACAAGGCTT ATGGTGGAAA CTTTGATCCG AGTTCTATAA AGACGAGAGA TGCTTTAGAA
GAAGCATTTA AAAAAGTAGA AGCAACAGGT GCTAAAGCAC TAGAAATTTC TCCAATGGAT
TGGTCTTTAG GCGCACATTT CCTTTCAATA GCGTATGCGG ATCAATCTAA AGATCCTGCT
CAAGTAGCTC AATTTTTATC AGACTTAAAA GCGGGAAAAG TTGATTTAGC AAATAATAAA
GTTTTTAACG GTTTAATGGA TACTTTTGAC ATGATGAAAA AGTACAACAT AGATAAAAAT
GATCCATTAT CTCCGACTTA TGATAGAGGA CCAGAGCTTA TTGGTAAAGG TGAAGTTGGA
TTTTGGTTTA TGGGAAATTG GGCATGGCCA CAGATAAAAG AATTTGATAC TGCAAATGGA
CAATACGGCT TTATACCTGT ACCAATCAGC AATAACCCAG ATGACTATGG TAATTCAGGT
ATACCTGTAG GTGTAACAAA ATTTATCGGC ATAGATAAAA CACAAAATAG TGCTGAGCAG
CAAGATGCAG CTAAGAAATT TTTAGATTGG TTGGTATACA GCTCTACAGG TCAAGACATG
CTTGTGAACA AACTTAACAT TATACCTGCA TTTAAAAATA TAACTTTACA ACCGCAAGAT
CCCCTTGCTA AATCTATTTT GCAGTATGTT AAGAGTGGTA ATACTTTAGA GTTTATGACT
ACATTGCCAC CTGACCACTG GTCAAAGTTA GGAGCTTCAA TGCAAAAGTA TTTGGCAGGG
AAAATTGACA GAAAAGGCTT GATTGATGAA ATAGAAAATT ATTGGAAAAA TGTTCAATAA
 
Protein sequence
MKRKILSIML TFALVFSLMA GCGTKSSDNG ESNSTATATK TVKITLLNSK GEIQAQLEDA 
AKAFTKENPN ITVEVIPATA GQSPFEKVTS MYASGNAPTM AMLDPGDIAK FKDKFLDLSS
EKWVSDAIDG ALNAATVDGK VIAFPFAVEG YGLIYNKAVL DKAYGGNFDP SSIKTRDALE
EAFKKVEATG AKALEISPMD WSLGAHFLSI AYADQSKDPA QVAQFLSDLK AGKVDLANNK
VFNGLMDTFD MMKKYNIDKN DPLSPTYDRG PELIGKGEVG FWFMGNWAWP QIKEFDTANG
QYGFIPVPIS NNPDDYGNSG IPVGVTKFIG IDKTQNSAEQ QDAAKKFLDW LVYSSTGQDM
LVNKLNIIPA FKNITLQPQD PLAKSILQYV KSGNTLEFMT TLPPDHWSKL GASMQKYLAG
KIDRKGLIDE IENYWKNVQ