Gene BURPS668_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1297 
SymbolnuoH 
ID4881817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1267706 
End bp1268770 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID640127225 
ProductNADH dehydrogenase subunit H 
Protein accessionYP_001058345 
Protein GI126441743 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTGT TCGATACGAT CAACTCGGGC GGAGCCCAGC TTCTCGGCGT CGCATGGCCG 
ACGGTGTGGG CGCTCGTGCG CATCCTCGTC GTCGCCGTCG TGATCCTGCT GTGCGTCGCG
TACCTGATTC TGTGGGAGCG CAAGCTGATC GGCTGGATGC ACGTGCGTCT CGGTCCGAAC
CGCGTCGGCC CGGCGGGCCT GCTGCAGCCG ATCGCCGACG TGCTGAAGCT GCTGCTCAAG
GAAGTGATTC GTCCGACGGC CGCGAGCCGC TGGCTGTATC TGGTCGCGCC CGTGATGACG
GTGGTGCCGG CGTTCGCGGT GTGGGCGGTG ATCCCGTTCC AGGCGGGCGC GGTGCTCGCG
AACATCAACG CCGGCCTGCT GTACGCGATG GCGATTTCGT CGATCGGCGT CTACGCGGTG
ATTCTCGCCG GCTGGGCGTC GAACTCGAAG TACGCGTTTC TCGGCGCGAT GCGCGCGGCC
GCGCAGATGG TGTCGTATGA AATCTCGATG GGCTTCGCGC TCGTGCTCGT GCTGATGACG
GCGGGCAGCC TGAACCTGTC GGAGATCGTC GGCTCGCAGC AGCACGGCTT CTTCGCGGGC
CACGGCGTCA ATTTCCTGTC GTGGAACTGG CTGCCGCTGC TGCCCGTGTT CGTCATCTAC
TTCATCTCGG GCATCGCCGA AACGAACCGC CACCCGTTCG ACGTGGTGGA AGGGGAATCG
GAAATCGTCG CGGGTCACAT GATCGACTAC TCGGGGATGG CGTTCGCGCT GTTCTTCCTC
GCCGAGTACA TCAACATGAT CGTGATCTCG GCGCTCGCGG CGACGCTGTT CCTCGGCGGC
TGGGACGCGC CGTTCGAATT CCTGTCGTTC ATTCCGGGCA TCTTCTGGCT GGTGCTGAAA
ATCTTCGCGC TGCTGTCGGT GTTCATTTGG GCCCGTGCGA CGTTCCCGCG TTACCGCTAC
GACCAGATCA TGCGCCTCGG CTGGAAGGTG TTCCTGCCCG TGTGCGTGTT CTGGGTGATC
GTGGTCGGTT TCTGGATGAT GTCGCCGCTG AATATCTGGA AATAA
 
Protein sequence
MSLFDTINSG GAQLLGVAWP TVWALVRILV VAVVILLCVA YLILWERKLI GWMHVRLGPN 
RVGPAGLLQP IADVLKLLLK EVIRPTAASR WLYLVAPVMT VVPAFAVWAV IPFQAGAVLA
NINAGLLYAM AISSIGVYAV ILAGWASNSK YAFLGAMRAA AQMVSYEISM GFALVLVLMT
AGSLNLSEIV GSQQHGFFAG HGVNFLSWNW LPLLPVFVIY FISGIAETNR HPFDVVEGES
EIVAGHMIDY SGMAFALFFL AEYINMIVIS ALAATLFLGG WDAPFEFLSF IPGIFWLVLK
IFALLSVFIW ARATFPRYRY DQIMRLGWKV FLPVCVFWVI VVGFWMMSPL NIWK