Gene Tneu_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0421 
Symbol 
ID6166206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp379540 
End bp380949 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content63% 
IMG OID641667579 
Productaldehyde dehydrogenase 
Protein accessionYP_001793815 
Protein GI171184896 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.96029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0394656 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACC TAGTGGTGGT TAATCCAGCC ACCGAGGAGG CAATCGCCGA GCTCCCACAA 
GCCACGAGAG AAGACGTGAG GAGGGCCATA GACGCCGCGT GGGACGCCTT CGCCAGCTGG
TCGGCTCTCC CCCTGAGGAA GAGGACCCGC GTCTTGCTGA AGACCGCCGA GCTTGCCGAG
GCCGCCAGGG AGGACCTCCT CAAGACGCTG GTGGCGGAGT CAGGAAAGCC CATCAGAGAT
GCGGAGGCGG AGATCACGAG GGCGATAGAC ATCTTCCGCT CCAGCGCGGA GGAGGCCAAA
CTGATCCTGG AGGGGTCCGC CCCCAGGGTA GACGCCTACG AGTACCCAAT CGGCAACGAA
AACAGGCTCG TGGTGGCCGT GAGGGAGCCC GTGGGCGTCG TCGGGGGGGC CCTCAGCTAC
AACAACCCCG CCTCCACCTT CGCCCACAAG GTGGCCCCCG TCATCGCGGC GGGGAACACA
GTCGTCGTGA AGCCCTCCTC CTACACCCCC CTCACCGCCC TCAAGTTCCT GGAGATTATG
AAGAGGGCTG GGGTGCCCGA GGGGGTGGTA AACGTGGTTG TGGGCAGCGG GGAGGAGGTC
TTCGACGAGC TTATCCAGAG CGACAAGGTC GCTGGGATAA ACTTCACCGG TAGCACAGCG
GTAGGGCTAC AGGTGGCGGC TAAGGCCGCC TCCAGGGGGA AGAAGTTCAT GATAGCGCCA
GGGGGCTCCG ACCCGGCCGT GGTGTTTAAA GACGCCGATT TAGACGCGGC GGCTAGGATC
GTCGCCAGAG CCCGGTACGA AAACGCGGGC CAGAACTGCA ACGCCACCAA GAGGGTTTTC
GTGGAGCGGG AGGTCTACCC CAGGTTCGTG GAGCTCCTCC TCGGCTATGT GAAAGCCATA
AGGGTGGGCG ACCCCATGGA CTACAGCACG GACATGGGGC CCCTCATCTC CGAGAAGATA
GTGAAGGCCA TGGACGGCGT AGTGAAAGAC GCCCTTGAGA AAGGCGCTAA GCTGGCGGCG
GGGGGCAGGA GAATGAACAG GAGGGGCTAC TTCTACGAGC CCACCGTCCT CCTCTTCGAC
GGCGACGCCG AGGCTAAGGC GCTTAGGGAG GAGGTCTTCG GGCCGGTTCT GCCCGTGGTG
CCCTTCGAGG GGGAGGAGGA GGCCGTCCGC CTCGCCAACG CCACCCAGTA CGGCCTACAG
TCGGCCGTCT TCACCTCGGA CTACAGGAAG GCGCTTAGGG TGGCGAGAGG CATAAAGGCG
GGGGCGGTCA TGATAAACGA GAGCACCAGG GTGAGGTTCG ATGCCCTTCC CTACGGCGGC
GTTAAGATGT CGGGCTTCGG CTGGAGAGAG GGCGTGAGGT CGACCATGAT ATACTACACA
GAGCCCAAGT TCCTCGTCTT CGGGCTTTGA
 
Protein sequence
MTNLVVVNPA TEEAIAELPQ ATREDVRRAI DAAWDAFASW SALPLRKRTR VLLKTAELAE 
AAREDLLKTL VAESGKPIRD AEAEITRAID IFRSSAEEAK LILEGSAPRV DAYEYPIGNE
NRLVVAVREP VGVVGGALSY NNPASTFAHK VAPVIAAGNT VVVKPSSYTP LTALKFLEIM
KRAGVPEGVV NVVVGSGEEV FDELIQSDKV AGINFTGSTA VGLQVAAKAA SRGKKFMIAP
GGSDPAVVFK DADLDAAARI VARARYENAG QNCNATKRVF VEREVYPRFV ELLLGYVKAI
RVGDPMDYST DMGPLISEKI VKAMDGVVKD ALEKGAKLAA GGRRMNRRGY FYEPTVLLFD
GDAEAKALRE EVFGPVLPVV PFEGEEEAVR LANATQYGLQ SAVFTSDYRK ALRVARGIKA
GAVMINESTR VRFDALPYGG VKMSGFGWRE GVRSTMIYYT EPKFLVFGL