Gene Cagg_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1622 
Symbol 
ID7268923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1979603 
End bp1981270 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content58% 
IMG OID643566463 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_002462959 
Protein GI219848526 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.878864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0450329 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGC CGGAGTATCT CGTTCCTGCC GCCGATGGTG TGCCGTGGTT GACGCTACTG 
GTGTTGTCGC CATTGGTTGG GATCGCGCTA ATCGGTTTGG CGTGGTTGAT GAAGCTCGAT
GAACGGACGG TCAAAGCGGG GGTACTGGCG TGGACGGGGG TGCCGTTGCT CTTGGCCGGC
TTGATCTGGG CACGGTTTGA TCCGCAAGCG GTAGCGAGTG GGCAAGGTGT CGTACAGTTG
GTTGAGCGAG TGCCGTGGAT ACAGGCGGTA CGGGTTGATT ACTTTCTCGG GGTTGACGGG
CTGAGTATGC CCTTGGTATT GCTGACGGCA GTGATGACGC CGGTGGCAGT CGTTGCGAGT
TGGCGCGTAA GTGAGCGGGT GCATGCTCAT TTGGCGTTAC TGCTATTGCT CGAGGCGGCG
ATGCTGGGCT ACTTCGTCGC GCTCGATTTC TTCTTCTTCT TCATCTTCTG GGAGTTTAGT
CTAGTACCGG CCTTTTTCTT AATCCAAAAC TGGGGGCGTG AACAGCGTCG CTATGCTGCC
TTTAAGTTCT TTGTGTATAC GATGGCCGGC TCGCTGGGCA TGTTGTTACT CTTCCAGGTG
ATATATCTGG CAATGCGGCA GGCCGGTTAT CCGACCTTCG ACCTGATCGC GCTCGGACGG
TTGGGTCAGG GCTTGCCGGT CGAGGGGGTA ACCGGTAACT TGCGAGATAT TCTCTTTGCC
TATCTCGACC AGCTTGGGGT AACGAATGTG CTTGGTCGTT ATCCACTGCT TTACAACAGC
ATTGCGATGT GGGCCATCTT TATCGCCTTC GCCATCAAGC TTGCCGTTTG GCCGTTCCAC
ACGTGGCTCC CCGATGCCTA TGCCGAAGGG CCGACTGCGG CCAGTATTCT ACTTTCGGCG
GTGATGAGCA AGATGGGAGC GTATGGTATG CTGCGGCTCC TGCTCCCCTT TACGCCCGAT
GCAGCCCAAT ACTTTGCTCC AGCGCTGGCT GCGTTGGCGG TAGTGGGCGT TGTAGCAGGT
GCCTTCGGTG CGTTGGGGCA GGTCGACGGC GACGTAAAGC GATTGATCGG CTATACGTCG
ATCAACCACA TGGGTTATGT GATGCTGGCG ATTGCCGGCG CTGCCGCAGC GGGTGAAGCG
GGGATCGATG CGCGCACGAG TGCGATCAAC GGTGCATTGG TTCAGATGGT AGCTCACGGT
CTCAGTACCG GTGCGCTGTT CTACCTTGCC GGCGCGCTGC ACGAGCGTAC CGGTCGTTGG
GAATTGAGTG GATTAGGTGG TTTGCGGACC GGTGCTCCGA CCTTTGCCGG TGTGATGGGG
ATTGCCCTCT TCGCCAATCT TGGCTTGCCC GGTTTGGCCG GTTTTGTCGG CGAGTTCTTC
ATCTTCCGTG GCGCATGGGC GACGTTGCCT TTCTTTACCG CCCTGGCCGT GGTAGGGTTG
GTTGTGACTG CACTTGCGCT GCTGTTGATG TTCCAGCGCA TTTTTCTTGG TCCGGCTGTT
GGGATGCCAC GCACCATTAC CGATCTGCGT CCGCAAGAGT TCTGGACGAT GGCGCCGATT
TTGGCCCTCT CGTTGGCAAT CGGGGTGTAT CCCGGCCCGC TGATGGCGTT GGGTAATGCC
GCAGCCACGC AGTTGGTGGC GATCTTTACG CGAGTACTGG CAGGATGA
 
Protein sequence
MNLPEYLVPA ADGVPWLTLL VLSPLVGIAL IGLAWLMKLD ERTVKAGVLA WTGVPLLLAG 
LIWARFDPQA VASGQGVVQL VERVPWIQAV RVDYFLGVDG LSMPLVLLTA VMTPVAVVAS
WRVSERVHAH LALLLLLEAA MLGYFVALDF FFFFIFWEFS LVPAFFLIQN WGREQRRYAA
FKFFVYTMAG SLGMLLLFQV IYLAMRQAGY PTFDLIALGR LGQGLPVEGV TGNLRDILFA
YLDQLGVTNV LGRYPLLYNS IAMWAIFIAF AIKLAVWPFH TWLPDAYAEG PTAASILLSA
VMSKMGAYGM LRLLLPFTPD AAQYFAPALA ALAVVGVVAG AFGALGQVDG DVKRLIGYTS
INHMGYVMLA IAGAAAAGEA GIDARTSAIN GALVQMVAHG LSTGALFYLA GALHERTGRW
ELSGLGGLRT GAPTFAGVMG IALFANLGLP GLAGFVGEFF IFRGAWATLP FFTALAVVGL
VVTALALLLM FQRIFLGPAV GMPRTITDLR PQEFWTMAPI LALSLAIGVY PGPLMALGNA
AATQLVAIFT RVLAG