Gene Tpen_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1672 
Symbol 
ID4600924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1618086 
End bp1620008 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content45% 
IMG OID639774445 
Productglycoside hydrolase family 42 protein 
Protein accessionYP_921070 
Protein GI119720575 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3934] Endo-beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.590042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACGAAG CGTTCAAGTT CTTACTAGGA GTTAACTACT GGCCTAGACT ATACAACGTA 
AAAATGTGGA AAGAATGGGA CGAAGAGAGC CTAAAGAAAG ACATAGAAAA AATGAAAGAA
CTCGGCGTTA GAGTAGTAAG GATATTTTTA AGGGATATAG ATTTTGCGGA TGAGAGAGGA
ATTCCAATTG AAGAGAGTCT GCAGAAGCTA CAAAGATTCC TCGATTTGCT TCACGAGAAA
AACCTCCAGG CATTCGTAAC GTTACTCGTA GGACACATGA GCGGAAAAAA CTTCCCAATA
CCGTGGACCA GCTTCGACAG CCTTTACACC CCCTCTTCCG TGGAGAAGAC CGCTACTTTT
GCGAGAAAAA TAGCAGAAAG GCTGGCATCA CACCCTGCAC TTGCTGGCTG GATACTAAGT
AACGAGCTCA GCTTGGTAAA GAGAGCTACG ACAAGAGAAG ATGCCCTCAG GCTACTCGAA
GCATTTACAA AAACTATGAA ATCCGTAGAC CCAAATCACA TCGTATCCAG CGGAGACATA
CCAGACAGCT TTATGCAAGA AACCCCTAAT GTACGTCACC TCGTCGACTA CGTTGGGCCA
CACTTGTACC TATACGACAC CGATCTCGTA CGGCTTGGAT ACTTCTATGG GGCAATGCTT
GAACTCTTCT CCAACGCAGG AGACCTGCCA GTCATACTTG AAGAGTTCGG CTTTAGTACA
CTACAATTCA GCGAGGAAAG CCACGCGCGA TTTGTGGAGG AAATTCTTTA CACATCTCTA
GCTCACGAAG CTTCTGGAGC GTTTATTTGG TGTTTCTCAG ACTTCACAGA AGAGAGCGGT
GAGCCATACG ATTGGAGACC TTTAGAGCTC GGCTTTGGTT TACTGAAGAA AGACGGTAGC
GAAAAACTAG CAGCAGACTC TTACAGGAAC TTTTCTCATG TGGTCGAAAG AATAGAAAAG
CTCGGACTTC ACTCTAAGTA CAAACGTTTA TCAAGCACTT TTGTAGTTTA TCCATTTTAC
TTATTCAGAG ACTACGAGTT CATATGGTAC AAAGAGTCAC TAGGCTTTTG GGAATCCATA
AAACCGCACT TGATGAGCTA CTCGCTTCTA TCCGCCTCTA GCGTTCCTTC TCGAATGGTT
TACGAGCTAG ACCTGAAAAA GATTTTAAAA AGCGCTAAGC TAGTAGTCCT TCCTTCTGTA
GTAGCTACAC TTGCTTCTAC GTGGCGCAAC CTTCTAGAAT ATGTAGAGCT CGGCGGAACC
CTCTATTCAT CCGTTATCAG GGGAGCCGGT GCTTTCAAAG CCCTCCACGA TGCGCCGACA
CACCTTTGGA ACGAGCTGTT TGGCGTGGAA AACGTTCTAG AGGCAGGCTC CATGGGACGC
AAGATCTTCG GAGTCGTTAA ATTGAAATTC GTCAGGAAAT TTGGCAACCT CAGTGAGGGG
GACGAACTAT TACTAAAAGT ACCAGAAAGC ATCTATACTT TCAAGGCGCA AAGCACGGAC
TCGGACGTAA TAGCCTTGGA TGACGAAGGA GAACCGGTAA TCTTCTTCTC TCGAAGGGGG
CGTGGCAAGA CCATTCTATC GCTGATACCT ATAGAGGTGA TATTACAGGC ACAGGAAAAC
GCTCAATGGC ACGAAGGAAC AATATTTTAT GAACAACTTG CCTTTGTTTC AGAGGTAGAA
AGACGTTATG CTTCAAAGGA TCCTAGAGTC GAGCTACAAG TTTACACGGG AGAAAAGGAC
GATCTTTTAA TAGTGATTAA TCACAGCAAT GAAAATGTGG AGACAAGCAT TACGAGTGCT
ACAAGAATCG TAGAAGCACA AGTAATTGGC GGTAAAGCAA GGCTATTACC AGAGTCGAAG
AGAGAGATGA GAGCTGTATT TCCCCCAAAG TCGGGCTCCA TTATTCGTGT CGTTAAAACC
TAG
 
Protein sequence
MDEAFKFLLG VNYWPRLYNV KMWKEWDEES LKKDIEKMKE LGVRVVRIFL RDIDFADERG 
IPIEESLQKL QRFLDLLHEK NLQAFVTLLV GHMSGKNFPI PWTSFDSLYT PSSVEKTATF
ARKIAERLAS HPALAGWILS NELSLVKRAT TREDALRLLE AFTKTMKSVD PNHIVSSGDI
PDSFMQETPN VRHLVDYVGP HLYLYDTDLV RLGYFYGAML ELFSNAGDLP VILEEFGFST
LQFSEESHAR FVEEILYTSL AHEASGAFIW CFSDFTEESG EPYDWRPLEL GFGLLKKDGS
EKLAADSYRN FSHVVERIEK LGLHSKYKRL SSTFVVYPFY LFRDYEFIWY KESLGFWESI
KPHLMSYSLL SASSVPSRMV YELDLKKILK SAKLVVLPSV VATLASTWRN LLEYVELGGT
LYSSVIRGAG AFKALHDAPT HLWNELFGVE NVLEAGSMGR KIFGVVKLKF VRKFGNLSEG
DELLLKVPES IYTFKAQSTD SDVIALDDEG EPVIFFSRRG RGKTILSLIP IEVILQAQEN
AQWHEGTIFY EQLAFVSEVE RRYASKDPRV ELQVYTGEKD DLLIVINHSN ENVETSITSA
TRIVEAQVIG GKARLLPESK REMRAVFPPK SGSIIRVVKT