Gene Tpen_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0210 
Symbol 
ID4602221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp188817 
End bp190646 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content53% 
IMG OID639772964 
Productglycoside hydrolase family protein 
Protein accessionYP_919623 
Protein GI119719128 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.044129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAAAG TATCACCTGG GAAAATCGGA GCTGTAATCT TACAGGGAAG CAAGGTATCG 
CTGAACTTTA CCGTGGAGAA CGATGAAGGC ACTCCCGTAA AGGGCACTCC TTTCTTCAAC
CTAAAGGCTC AGGGGAAGCA ACAGATATAC GTTGTGTTCG AGCAAGCGGT TATTCAGCCT
GGAGAAAAAG CAACGTTCAA CTTTGAAGTT AAATTCGACG TACCGGTTGG AGAGGCTGTC
GCGTACTTCT GCTTCGAGAC GGACAAGGCT GTATGTGAAG AGAGGCAGGT CTACATCGCG
GGGGAGGGTG AAGCCATCTA CGTAGCTTTC GTGTGGCATC ACCACCAGGC CCCCCAGTTT
TACCCCGACG GGAGCCTAAA GGACGAGTGG GCATTTATAC ACGTCGCTAA GGGCGACTTC
TACGCGTATT CGGGGGGACC CTACAAGGTA CACATGGAGA CGCACAAACG TATACCAGGC
TTTATCGATG TAGACCACTT CTCCCCGTCG CTCCTGGAGC AGTGGTTGCT TTTTCTATCG
GGTAAGCTAA GGTCCTCGAC AGCAACCAAG GAGGACGTAG AGGGGTTGCT GAGCTTTCTT
AGGGAGAAGA TCCGGGAAGG CATCGTGGAG CCCCTGGGAA GCGTCTACGC CCACACGGTT
CTCGGCCTCG TTCTGAAGAA GGCTAAGCAG AGGGGCCTCG ACGAGATAGC GAAGAAGTTG
ATCGAGTGGG AGCTCCGGGA GGGCTTGAAG ATTGTGGAGG AAGCCCTGGG CCGCCGGCCG
AGCGGGCTTT GGACACCCGA AATGTTCTGG CACATGGACC TCGTCGACCT CTACGGCTCG
CTCGGAGTAA GGTACACCGT CCTGTGCGAG CAACACTTTA CGCGCGCTGG TGGGGACAAG
GAGAACATTT ACGAGCCATA CGTCGTCGAG GACCCTATTT CCGGGAGATC CGTCGTCGTC
TTCTTCAGGG ACCTAAAGCT CAGCAACTGG ATTAGCTTCA AGGTTGACTT CAAGAATCCA
GAGGAAGCGG ACAACGAAGC TAGAAGGTTT GTTATAGAGC TGGCTAAGAG GAGGGAGGTC
GCGCCAGGCG GGATTGTAAC CATAGCTCTC GACGGCGAGA ACTGGATGAT CATGCCGGCG
TACAGGAAGT ACGCGCCGTA CTTCCTGGAA AAAGTGCTCG AGTACATCTT GGAGAGCCAG
GTAATAAGGC TCACGACTTT ATCGAAGTAC CTCGACCAGA ACCCGCCGAA ACGCGTCCTT
GACTACATAC CGTCGGGCTC CTGGGTAGAG CTCTCGGATA AGCAGTGGAC CGGGGGCGCA
AAGGACGAGC TCTGGAACGA GGCAATGGAA ACTTTAGTCT ACGTTGAATC AGCGTATAGA
TTACTGGAAC CGGAGGCTGA GCGCCTGCTC GCAGATCCCG ACTCACCACT CTACAGGCTG
TTCAAAGCCA TCGCTATAGC CATAGACAGC GACTTCTACT GGTACGGAGA GTTGGAAAGA
GAACGGGAGT TTATAAAGGA GTGGTTAGCA GAGGCGCGGA AGATAGCTGG AGAAATACTC
GGAAACCTTA AGGCAAGGGA AGTCGGGAGG ACGAACAACC ATGCTTTCAT CGAAATTGAA
AACAGAAATC TCTTCTCGGC GAAGGTGAGG ATAGTTTCAG AGGCTAAGAG CCGAGTCGAT
GAAACGAGTA TAATTATCCC AGCGTCCTCG AAGGTAACCC TTCCCGTCTA CGTCGGTGAC
TCAAATACCC TTGTGAGAGT GGTTTCCGGT AAAGTAACTC TTCAAGTTGT TGGCGGCGTT
GACGGTTCTC CGGCGCCCCT CTCTCTTTAG
 
Protein sequence
MVKVSPGKIG AVILQGSKVS LNFTVENDEG TPVKGTPFFN LKAQGKQQIY VVFEQAVIQP 
GEKATFNFEV KFDVPVGEAV AYFCFETDKA VCEERQVYIA GEGEAIYVAF VWHHHQAPQF
YPDGSLKDEW AFIHVAKGDF YAYSGGPYKV HMETHKRIPG FIDVDHFSPS LLEQWLLFLS
GKLRSSTATK EDVEGLLSFL REKIREGIVE PLGSVYAHTV LGLVLKKAKQ RGLDEIAKKL
IEWELREGLK IVEEALGRRP SGLWTPEMFW HMDLVDLYGS LGVRYTVLCE QHFTRAGGDK
ENIYEPYVVE DPISGRSVVV FFRDLKLSNW ISFKVDFKNP EEADNEARRF VIELAKRREV
APGGIVTIAL DGENWMIMPA YRKYAPYFLE KVLEYILESQ VIRLTTLSKY LDQNPPKRVL
DYIPSGSWVE LSDKQWTGGA KDELWNEAME TLVYVESAYR LLEPEAERLL ADPDSPLYRL
FKAIAIAIDS DFYWYGELER EREFIKEWLA EARKIAGEIL GNLKAREVGR TNNHAFIEIE
NRNLFSAKVR IVSEAKSRVD ETSIIIPASS KVTLPVYVGD SNTLVRVVSG KVTLQVVGGV
DGSPAPLSL