Gene Cagg_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1620 
Symbol 
ID7268921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1975766 
End bp1977349 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content54% 
IMG OID643566461 
Productproton-translocating NADH-quinone oxidoreductase, chain N 
Protein accessionYP_002462957 
Protein GI219848524 
COG category[C] Energy production and conversion 
COG ID[COG1007] NADH:ubiquinone oxidoreductase subunit 2 (chain N) 
TIGRFAM ID[TIGR01770] proton-translocating NADH-quinone oxidoreductase, chain N 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.308242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0422608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCAAC TGGCCGACAT CCCGCGTCTC TTGCCGGAGA TACTGTTGTT GGTACTAGCT 
CTGTTGGTGC TAGGATCTGA CATCCTTGAA CGGTGGGGAC GCACGCCGGA AGCGCAACAA
GAGCGGGTTA AGTCATCGGC GTCGCTGACG GCTATAGGGT TGGGTATGGT GTTTGTTGTC
GCTCTCTTGC AGAGCGGTTA CGTCTACCAA TTGCCGGAGA CAGCGCCGGT TAACTTCTTC
ACCAATCTGA TCCGTAATTT ACAAGGTGGT GGGCCACGTG ATGATGCGAT TGCCGGTGTC
TTTGTCGCCG ATCATCTGAC AATGGTGTCA CGCCTGCTCA TTATTGGTGC TGCTTTTCTG
ACGAGTTTGC TCTGCACCGA TGTGCGGCCA AATGCGCATC CGGGTGAATT TTATGCTCTG
ATTATCTTTG CTACGCTTGG TATGTGCTTG ATGGTCGGGG CAAATGAATT TCTGCTGGCT
TATCTGGCAA TCGAGCTGAC CTCTATCCCG TTGTATCTGT TGGCGGGTTA CTTCCACAAT
GATGCACGGT CGGCAGAGTC AGGTTTGAAA TACTTCTTGT TTGGGGCTGT TTCATCGGCA
ATCTTGTTGT ATGGGATGAG TCTGGCATTT GGTGCGGCGC TGAACGGCGT GAGTGGAGTG
ACCAATTTCA ATGATCTGAC CCGGTTTGAT CGGATTGGTG CTTTTACGGC TAGTGGTGGC
TCGATAACCC TGGCGATGCT CTTCATCGTA GCGGGGATGG GCTATAAGTT GGCGATCGTT
CCCTTCCATG GGTGGTCGCC TGATGTGTAC GAAGGCGCAC CAACGCCGAT TACGGCCTTT
ATCTCGACGG CGTCGAAGGC GGCAGGGTTT ATTCTGCTGT TCCGTCTGTT GACGAAGACG
TTCCCGGCAA TTGTTGGCGC GCCGGTGTTT GGAGATGAAG CCGGTGGTTG GACGGGGGTG
TTGGCGGTGC TGGCCTTGCT GACGGTTGTG ATCGGGAATT TGGCGGCATT GCCACAGACG
AACGCAAAGC GACTGCTGGC CTATTCGAGC ATTGCGCACG CCGGATTTGT TGTACTGGGA
TTGCTCGCGT GGGCGGCGGC GCAAACCTTC GACCGTGAGC AAGGGTTGGT GGCGTTGCTG
TATTACTTGA TCATCTATAG CCTGACGAAT TTGGGTGCGT TCGGTGCGTT AGCCTTGATC
GGTCACCAGA CCGGGGGTGA TGATTTTGAC CACCTGCGTG GTCTCTCGCG CCGTAACTTA
CCGCTGGCAC TGCTGTTTAC CGTCTGTATT CTCTCGCTGG CCGGTATTCC GCCGCTTGGT
GGTTTCTTCG CTAAGTTCTA CATCTTTATG GCGGGTTGGC AGAGTGGGGC GACGTGGCTG
GTGATTATTG CCGTGATTAC CACCATCATC AGTTTGTACT ATTATCTGCG TTTGCTGAAG
GTGATGTTTA TCGAGCCGGC AATTGATCCG ACACCGGTTA CAATGCCACG AGGTATTGCG
GCAGCATTAG GTATCGCTGT GGTGGGCGTG TTGGTGTTGG GTGTTTTCCC TAATGTGATC
TTGAGTGTCT TAGAACGGGT GTGA
 
Protein sequence
MFQLADIPRL LPEILLLVLA LLVLGSDILE RWGRTPEAQQ ERVKSSASLT AIGLGMVFVV 
ALLQSGYVYQ LPETAPVNFF TNLIRNLQGG GPRDDAIAGV FVADHLTMVS RLLIIGAAFL
TSLLCTDVRP NAHPGEFYAL IIFATLGMCL MVGANEFLLA YLAIELTSIP LYLLAGYFHN
DARSAESGLK YFLFGAVSSA ILLYGMSLAF GAALNGVSGV TNFNDLTRFD RIGAFTASGG
SITLAMLFIV AGMGYKLAIV PFHGWSPDVY EGAPTPITAF ISTASKAAGF ILLFRLLTKT
FPAIVGAPVF GDEAGGWTGV LAVLALLTVV IGNLAALPQT NAKRLLAYSS IAHAGFVVLG
LLAWAAAQTF DREQGLVALL YYLIIYSLTN LGAFGALALI GHQTGGDDFD HLRGLSRRNL
PLALLFTVCI LSLAGIPPLG GFFAKFYIFM AGWQSGATWL VIIAVITTII SLYYYLRLLK
VMFIEPAIDP TPVTMPRGIA AALGIAVVGV LVLGVFPNVI LSVLERV