Gene Haur_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4989 
Symbol 
ID5736825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6325261 
End bp6326958 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content51% 
IMG OID641282156 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_001547747 
Protein GI159901500 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAACG AATTTATCAA ATTCAGTGAT TGGAGCATCA CCACGCTAAT TGCCCTTTCG 
CCATTGGTGG GGATGTTGCT GGCCTTGGTA TTCCCCAAGC CAGCCGAAAA CAGCCGCACG
ATTGCTTGGG GGGTGTTTGC GTGGAGCTTG GTGCCACTCG GACTCACGCT CTTCTTGTGG
CTTAGCGGTG GGTTTAATCC AGCCCTCGCT TCTGTCGCTG GCGATCAAGC CATGATCCAG
CAAGTGGATC GGGTGCGCTG GGTTCCATTT TTCAACGCTG ACTATTTTGT TGGCCTTGAT
GGCCTGAACT TCCCGCTGGT GTTTTTGACC ACAGCGTTAA CGCCAGTCTG TATCTTGGCA
GCCTTCCGCA TCAAACATCG CCAAAACGTC TATTTGGCCT TGATGTTGTT GTTGGAATCG
GCGATGTTGG GCTATTTCGT ATCGCTCAAC TTCTTGCTGT TGTTCCTGTT CTGGGAATTC
AGCTTGGTGC CAATGTTCTT TATTATCAAC AACTGGGGTG GTGAAAACCG CCGCTACGCT
GCCTTCAAGT TCTTCGTGTA TACGATGGCT GGCTCGGTGG CGATGTTGTT GATTTTCGAA
TTTATCTATT TGGCGACTGG TACCTTCGAT TTGGTGGTGC TCTCACGCTT GGGTCAGGGC
TTGCCCGTTG ATCCAGCCTT GCTTGCGCCA AAATTGGGTG CAGGCTACAC CAGCGGCGCA
ACCTTGCAAT CAATGTTGTT CAGCGCCGTC GAAGATATTG GCTTGACCAG CATTTTGGGT
ACAAGCAATG GCACCCCAGC AGCAATTGTC TTCTGGAGTA TCTTTGTGGC CTTTGCGGTG
AAATTGGCAG TTTGGCCGTT GCACACCTGG CAGCCCGACA CTTACGAAAA TGCCCCAACC
AGTGGCTCGA TGATTGTCTC AGCCGTGATG TCGAAGATGG GTGCGTATGG CATGATCCGC
ATTATGATTA TGCTCTTCCC CCAACAAACC AAATTCTTCG CACCAGCGTT AGCAATCTTG
GCCTTGGCAA GCATTTTGTT TGGTGCCTAC GCTGGTTTGG CCCAAATCAA CCTTAAGCGT
TTGATCGCCT ATGCTTCGAT TAACCACATG GGCTATGTTT TGCTTGGCTT GGCGGCAGTA
GCTTCGGCAG CGCCCGAAAG CCTTGGCGAC TTAGCCGTGA ATATCCGCGC CTCAGCAATG
AATGGGGTGC AAGCACAGAT GGTTGCCCAC GGTTTCAGCA CCGCCGCATT GTTCTTCCTC
GCGGGTGAAC TCTACGAACG GACTGGCACG TACCAGCTTG ATCAATTTGG CGGCTTGCGT
AAAGTTATGC CAATTTTTGC TGGGATTATG GGCGTGGCGA TGTTTGCCAA CCTTGGGTTA
CCTGGTTTGG CTGGCTTCGT CGGCGAATTC TTTATTTTCC GTGGCGCGTG GGGCACGCAG
CCAGTGATCA CCACAATTGC TGTGTTGGGC TTGATTGTGA CTGCCTTGGT GCTGATCCGA
ATGTATCAAA AGATCTTCTA CGGGCCAGTT AACCACAAGC TGACCAACCT GCCAGACATC
AAAGTTGGTG ATTGGGCCTT CAACGTAACC CTACCGTTGA TTATTGTACT GTTGGTGTTT
GGGATTTTCC CCAAGCCACT GATGGATTTA TCAAACTACG CAGCCACGGT GATGGCTCAG
GTGTTTACAA ACCTGTAA
 
Protein sequence
MLNEFIKFSD WSITTLIALS PLVGMLLALV FPKPAENSRT IAWGVFAWSL VPLGLTLFLW 
LSGGFNPALA SVAGDQAMIQ QVDRVRWVPF FNADYFVGLD GLNFPLVFLT TALTPVCILA
AFRIKHRQNV YLALMLLLES AMLGYFVSLN FLLLFLFWEF SLVPMFFIIN NWGGENRRYA
AFKFFVYTMA GSVAMLLIFE FIYLATGTFD LVVLSRLGQG LPVDPALLAP KLGAGYTSGA
TLQSMLFSAV EDIGLTSILG TSNGTPAAIV FWSIFVAFAV KLAVWPLHTW QPDTYENAPT
SGSMIVSAVM SKMGAYGMIR IMIMLFPQQT KFFAPALAIL ALASILFGAY AGLAQINLKR
LIAYASINHM GYVLLGLAAV ASAAPESLGD LAVNIRASAM NGVQAQMVAH GFSTAALFFL
AGELYERTGT YQLDQFGGLR KVMPIFAGIM GVAMFANLGL PGLAGFVGEF FIFRGAWGTQ
PVITTIAVLG LIVTALVLIR MYQKIFYGPV NHKLTNLPDI KVGDWAFNVT LPLIIVLLVF
GIFPKPLMDL SNYAATVMAQ VFTNL