Gene Tneu_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0542 
Symbol 
ID6165759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp494837 
End bp495925 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content65% 
IMG OID641667695 
Productcellulase 
Protein accessionYP_001793931 
Protein GI171185012 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.192278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00117454 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAGGACT TCGTCGCTCT CCTAAAGACC CTATCCGAGG CGAGAGGCCC CTCCGGCTTC 
GAGGACGAGG TGAGGGAGAT CGTGATAAAG GAGATGGAGC CGTACGTAGA CGAGGTGTTG
GTGGACAGGT GGGGGAACGT AATAGGGGTC AAGAGGGGGG CCTCCGAGGT CCGGGCCATG
GTGGCGGCCC ACATGGACGA GATCGGGCTG GTTGTAGACC ACGTCGAGAA GGAGGGCTTT
CTAAGGTTTA GGCCGATCGG CGGCTGGAAC GAGGTGACGC TGCTCGGCCA GCGGGTGTGG
GTGAGGACTC AAGATGGGAG GTGGGTCAGG GGGGTCGTAG GCGTTACGCC GCCGCATGTG
ACCCCCTCCG GCCACGAGAG GGAGGCCCCG GAGATGAAAG ACCTCTACAT AGACGTGGGG
GCTAGAAGCA GGGAGGAGGC CGAGAAGATG GGCATCTCCG TCGGCTCCGT GGCCGTCCTC
GAGAGGGAGC TGGCCGTCTT AAACGGGAGG GTTGCGACGG GCAAGGCCTT CGACGACAGG
GTGGGCCTCG CCGTTATGTT GTACACCCTG CGGCAACTTG GCGACCTCCC CGTGACCCTA
TACGCCGTCG CCACGGTGCA GGAGGAGGTG GGCCTCCGGG GGGCCCAGAT AGCGGCGGAT
CGGATAGCCC CCCACTACGC GGTGGCCCTA GACACCACCA TAGCCGCCGA CGTGCCGGGT
GTAGGCGAGA GGCTACACGT GACTAAGCTG GGCGCGGGGC CCGCCATAAA GGTAATCGAC
GGCGGCCGCG GCGGCCTCTT CATAGCGCAC CCCGGGCTGA GGGACCACAT CGTGAAAATC
GCCAGGGAGG CCGGCATCCC CCACCAGCTT GAGGTGCTAT ACGGCGGCAC CACAGACGCC
ATGGCCATAG CCTTTAGGCG GGAGGGCGTG CCCGCCGCCG CCATCTCCAT ACCCACGCGC
TACGTCCACT CGCCGGTGGA GCTGGTGGAT CTGTCAGACG CGTTGAACGC GTCGCGGCTA
CTCAAGCAGG TGCTTGAGAA AACGACGCCG GCGGCGGTGG AGAAGTTCCT GGAGAGGAGG
GTGAAGTGA
 
Protein sequence
MEDFVALLKT LSEARGPSGF EDEVREIVIK EMEPYVDEVL VDRWGNVIGV KRGASEVRAM 
VAAHMDEIGL VVDHVEKEGF LRFRPIGGWN EVTLLGQRVW VRTQDGRWVR GVVGVTPPHV
TPSGHEREAP EMKDLYIDVG ARSREEAEKM GISVGSVAVL ERELAVLNGR VATGKAFDDR
VGLAVMLYTL RQLGDLPVTL YAVATVQEEV GLRGAQIAAD RIAPHYAVAL DTTIAADVPG
VGERLHVTKL GAGPAIKVID GGRGGLFIAH PGLRDHIVKI AREAGIPHQL EVLYGGTTDA
MAIAFRREGV PAAAISIPTR YVHSPVELVD LSDALNASRL LKQVLEKTTP AAVEKFLERR
VK