Gene Nther_0313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_0313 
Symbol 
ID6316146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp329673 
End bp330701 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content39% 
IMG OID642642699 
ProductCellulase 
Protein accessionYP_001916499 
Protein GI188584954 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTAT TGAAAAAACT CACTCAAACA CCTGGAATAC CGGGAAGAGA GGAGCCTATT 
GCGGAATTAA TCAAAGAAGA AATGAACCAA ATTTGCGATG AAGTATGGGT AGATCCCTTG
GGAAGTGTAA TAGGACTTAA AAAAGGTAAT GGAAACAAAA AGGTAATGGT TGCTGGTCAC
ATGGATGAGA TAGGTTTTAT AGTAAAGCAT ATCGATAAGA ATGGATTTAT TCGCTTACAG
CCTGTAGGTG GCTTCGATCC CCGAAGTTTA ATGGCTCAAA GGGTGATTGT TCATGGAAAG
GAAGACTTGA TTGGCAATTT AGCCCCAGCT ACTAAGCCAA TTCATGTCTT GAGTCCTGAA
GAGAAGAAAA AACAACTTCA AGTAAAGGAT TATTTTGTTG ATTTGGGTCT TTCTGGTGAA
AAAGTCAAGG AATTGGTAGA AATCGGTGAC CCTGTCACCC TAAAACAAGA TTTTGAAGAA
ATCGGGAACA TGTATAGTAG TAAATCCCTT GATGACAGAG TGGGAGTATA TGTCATGTTA
GAAGCTGCAA AACAGCTTAA GGACCATGAT GCAGATATTT ATCTTGTAGC TACCTCTCAG
GAGGAAGTGG GTATTAGAGG AGCCATGACA TCTTCTTATG GCATCGAGCC TGATGTAGGC
ATTGCCCTTG ATGTGACTAT AGCGGCAGAT ACTCCAGGAA GCGAGGAATC AGAACAGGTT
ACCAAATTAG GTGAAGGTGC AGCTATTAAA ATTATGGACT CTGCTAGCAT AACTAATAGA
AAAGTACTTC AGACATTAAA AGACCTAGCC AATGAAAAAG ATATTAATCA TCAAATGGAA
ATACTACCTA AAGGAGGAAC CGATGCTGGT TCAATCCAGA GAAGCAAATC TGGAATTCCT
GTGGGGACAA TATCTATACC ATGCAGGTAT GTACATACAG TCAATGAAAT GATCCATAAA
GAGGATTTAG ATGCGTCAGT AAACTTACTA TCTGCTTTCC TTGCTGAAGC AAATTTTAAC
GAATTTTAA
 
Protein sequence
MELLKKLTQT PGIPGREEPI AELIKEEMNQ ICDEVWVDPL GSVIGLKKGN GNKKVMVAGH 
MDEIGFIVKH IDKNGFIRLQ PVGGFDPRSL MAQRVIVHGK EDLIGNLAPA TKPIHVLSPE
EKKKQLQVKD YFVDLGLSGE KVKELVEIGD PVTLKQDFEE IGNMYSSKSL DDRVGVYVML
EAAKQLKDHD ADIYLVATSQ EEVGIRGAMT SSYGIEPDVG IALDVTIAAD TPGSEESEQV
TKLGEGAAIK IMDSASITNR KVLQTLKDLA NEKDINHQME ILPKGGTDAG SIQRSKSGIP
VGTISIPCRY VHTVNEMIHK EDLDASVNLL SAFLAEANFN EF