Gene Teth514_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0394 
Symbol 
ID5877602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp403081 
End bp404523 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content35% 
IMG OID641540730 
Productglycoside hydrolase family protein 
Protein accessionYP_001662042 
Protein GI167039057 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0289899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAG TAAAAATTGG TATTATAGGT GCAGGAAGTG CTGTATTTTC TCTGAGGTTG 
GTGAGCGATT TATGCAAAAC TCCAGCGTTA TATGGAAGCT TAGTAACTCT CATGGATATC
GATGAAGAAA GATTAGAGGC CGTGTACATA CTTGCAAAAA GGTATGTGGA AGAAGTAGGA
GCAGAGTTAA AGTTTGAAAA AACTAAAAAT TTAGAAGATG CCATAATAGA TGCAGATTTC
GTAATAAATA CAGCAATGGT AGGAGGACAT ACTTATCTAG AAAAAGTCAG ACAGATTAGC
GAAAAATATG GCTATTATAG AGGAATAGAT GCACAGGAAT TTAATATGGT TTCAGATTAT
TACACTTTCT CAAATTACAA TCAGTTGAAA TATTTCGTAG AAATTGCCAA GAAAATTGAA
AAACTGTCTC CAAACGCCTG GTATTTACAA GCTGCTAATC CCGTATTTGA AGGCACAACT
CTTGTAACAA GGACCTCTTC GATAAAAGCA GTTGGATTCT GCCATGGACA TCTTGCATTA
AAAGAAGTAT TTGACACACT GGGACTTAAG CATAATAAAG TAGATTGGCA AGTAGCAGGA
GTAAATCACG GAATATGGCT TAATAGATTT ATATATGAAG GAAAAAGTGC TTACGAAAAG
CTCAACACCT GGATAGAGGA AAATTCTCAC AACTGGAAAC CTCTTCATCC ATTTAATGAC
CAGCTATCCT CAGCTGCCAT TGATATGTAC AAGTTTCACG GAGTTTTGCC TGTTGGCGAT
ACCGTAAGAA ATGCTTCTTG GCGGTATCAT AAAAACCTGG AAACAAAGAA AAAGTGGTAT
GGAGAACCTT GGTGTGGTGC AGACTCTGAA ATAGGTTGGA AATGGTACCA AGAAACATTA
GGAAAAATTA CAGACATCAC TAAAAAGATT GCAAAGTTCT TGATAGAAAA TCCAAAAGCA
AAGTTTAGCG ATATAAAAGA AATTTTCGGT CAAGAGGCAA AAGACAATGA ATTACTACAG
GAAATGGAAA AAATACTAGA CCCAGAGCAA AAAAGTGAAG AACAGCACAT TCCTTTCGTA
GAATCGATTG TAACCGGCAA AAAAGAAAGA TTTGTAGTAA ATATACCAAA TAGAAGAATA
ATTCCTGCAG TAGAAAATGA TGTTGTTGTA GAAGTACCTG CAATAGTAGA TAGCGAAGGA
ATACATCCAG AAAAAATAGA ACCCATGCTC CCAGAGAGAG TAATAAAGTA TTACCTAAAA
CCGAGAATTA TGAGAATGGA AATGGCAGTA GAAGCATTTT TAACAGGGGA CATAGACATA
ATAAAAGAAC TTCTGTACAG AGACCCTAGA ACTCAAAATG ATGGGCAGGT AGAAAAAGTT
TTAGAAGAAA TTCTATCCCT ACCAGAAAAT GAAGAGATGA AAAAACATTA TTTAAAAAAA
TAG
 
Protein sequence
MPSVKIGIIG AGSAVFSLRL VSDLCKTPAL YGSLVTLMDI DEERLEAVYI LAKRYVEEVG 
AELKFEKTKN LEDAIIDADF VINTAMVGGH TYLEKVRQIS EKYGYYRGID AQEFNMVSDY
YTFSNYNQLK YFVEIAKKIE KLSPNAWYLQ AANPVFEGTT LVTRTSSIKA VGFCHGHLAL
KEVFDTLGLK HNKVDWQVAG VNHGIWLNRF IYEGKSAYEK LNTWIEENSH NWKPLHPFND
QLSSAAIDMY KFHGVLPVGD TVRNASWRYH KNLETKKKWY GEPWCGADSE IGWKWYQETL
GKITDITKKI AKFLIENPKA KFSDIKEIFG QEAKDNELLQ EMEKILDPEQ KSEEQHIPFV
ESIVTGKKER FVVNIPNRRI IPAVENDVVV EVPAIVDSEG IHPEKIEPML PERVIKYYLK
PRIMRMEMAV EAFLTGDIDI IKELLYRDPR TQNDGQVEKV LEEILSLPEN EEMKKHYLKK