Gene Tbd_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_1149 
SymbolnuoH 
ID3673221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp1203946 
End bp1204977 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID637709833 
ProductNADH dehydrogenase I chain H 
Protein accessionYP_314907 
Protein GI74317167 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCGG GTATCGTGGA GGCCGCGGGA ATGGATTGGG AGGCGTTCCA GACCGTCGCA 
TGGACGCTGG TCAAGATCAT GGCGCTGGTC GTGCCGCTGA TGCTCGGCGT CGCGTATCTG
ACCTATGCCG AGCGCAAGAT CATCGGCTGG ATGCAGGTCC GGATTGGCCC CAACCGCGTC
GGTTTCCAGG GTCTTCTGCA GCCGATCGCC GACGCCGTGA AGCTGCTGAT GAAGGAAATC
ATCATTCCTT CCGGCGCCAG CCGCGCGCTT TTCATCCTCG GCCCGATCCT CGCTATCGCG
CCGGCGCTCG CGGCTTGGGC GGTCATCCCG TTCAGCGACG GGCTCGTTCT CGCCGACATC
AACGCGGGAC TGCTCTACGT GATGGCGATC ACGTCGATGG GCGTCTATGG CGTGATCATC
GCGGGCTGGG CATCCAACTC GAAGTACGCA TTCCTCGGCG CGATGCGCTC GGCGGCGCAG
ATCGTGTCCT ACGAGATCGC GATGGGCTTC GCGCTCGTCG GCGTATTGAT GGCCTCGCAG
TCGCTCAATC TGAGCGCGAT CGTGCAGGGC CAGGCCGGCG GCATCCAGCA GTGGTATCTG
TGGCCATTGT TCCCGCTTTT CGTCGTCTAT CTCGTCGCCG GCGTCGCCGA GACCAACCGC
GCGCCGTTCG ACGTCGCGGA GGGCGAGTCC GAAATCGTCG CCGGCTTCCA CGTCGAATAC
TCGGGCATGG CTTTCGCCGT GTTCTTCCTT GCGGAATACG CCAACATGAT CCTGGTCGCC
GCGCTGACGA CGCTCATGTT CCTCGGCGGC TGGCTGTCGC CGGTCGCCTT CCTGCCGGAC
GGCATCGTCT GGTGGTTGCT GAAGACCGGC TTCGTGTTGT TCCTCTTCCT GTGGTTCCGC
GCGACCTTCC CGCGTTATCG CTACGACCAG ATCATGCGTC TCGGCTGGAA GGTCTTCATC
CCGATCACCA TTGTCTGGAT CGTCTTCGTC GGCGGCATGA TGCAGACGCC CTACGGTCAC
CTTTTCCATT GA
 
Protein sequence
MEAGIVEAAG MDWEAFQTVA WTLVKIMALV VPLMLGVAYL TYAERKIIGW MQVRIGPNRV 
GFQGLLQPIA DAVKLLMKEI IIPSGASRAL FILGPILAIA PALAAWAVIP FSDGLVLADI
NAGLLYVMAI TSMGVYGVII AGWASNSKYA FLGAMRSAAQ IVSYEIAMGF ALVGVLMASQ
SLNLSAIVQG QAGGIQQWYL WPLFPLFVVY LVAGVAETNR APFDVAEGES EIVAGFHVEY
SGMAFAVFFL AEYANMILVA ALTTLMFLGG WLSPVAFLPD GIVWWLLKTG FVLFLFLWFR
ATFPRYRYDQ IMRLGWKVFI PITIVWIVFV GGMMQTPYGH LFH