Gene Cagg_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1037 
Symbol 
ID7268409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1284910 
End bp1286487 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content53% 
IMG OID643565882 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_002462387 
Protein GI219847954 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.260264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGC CAGGCTTTCC GCTCCTCTCG CTCATCCTTT GGTTGCCGGC AGCCGGCGCC 
TTGGTACTGC TCTTTGTGCC GCGCGCCAAT GCCGAGCTGG CCCGCCGGGT ATCACTTGCC
ACGATGGCGG TCGTCTTTTT GCTCTCGCTC CTGTTGCCGC TGCGGTTTGA AACAAATCCG
CTCCAGACAA CTGTCGTCGG TGCATCGCCG GTCATGCAAT TTGTCGAAGA AGTGCCTTGG
TTGCCGATTG TTGGCGCAAC TTATAGTTTA GGCATTGATG GGATCAGCCT CTGGTTGGTC
ATGTTGACCA CCTTTTTGGG GCCAATCGTT GTGCTGTCGA CATGGGATTC GGTGCATAAA
GATGTGCGTA ACTTCCAGAT TCTGCTGCTG ATTTTACAGA CGGCCATGAT CGGCGTCTTT
CTCGCGCAGG ATCTCTTGTT ATTCTACCTG TTTTGGGAGT TTACCCTTAT CCCGATGACC
TTCTTGATCG GTATTTGGGG GAGTCAGAAC CGGATTTATG CAGCACGTAA GTTTTTTCTC
TACACATTTG CCGGCTCAGT TTTCATGCTG TTGGCGTTAA TTGCGTTGCA TATCCTGCAC
CGTAATGCGA TTGCCGAAAT TGAGCCTGGA TTTCGCGGTA CCTTCAGTTT TAGCCGGTTT
GTTAGTGATT TGCGCGCCGG TCGGCTGACC CTTGATAGTC TCACCGAGCG ACTGCTGTTT
GGCGCATTTT TCCTGGCCTT TGCCGTCAAA GTACCGCTGT GGCCGTTCCA TACGTGGTTA
CCCGATGCCC ACGTTGAAGC GCCGACCACC GGTTCGGTGG TACTGGCAGG GGTGTTGTTG
AAGCTGGGCG GCTACGGCAT GATTCGCTAC AATTTGACGC TCTTCCCGGC GGCCTCTCAG
TGGGCAGCAC CGGCACTGGC GATACTGGCC GTAATCGGTA TTATTTACGG CGCGGCTGTT
GCCTTTGCTC AATCAGACAT GAAGAAGTTG GTCGCCTATT CGTCAGTGAG CCATATGGGG
TTCGTTGTCC TGGGAATCTT TGCCCTCAAC ACTGAAGGAA TTAGCGGTGC TGTGTTGCAG
ATGGTCAATC ATGGTCTCAG CACGAGTGCG CTCTTTTTGA TGGTCGGTGT GCTCTATGAA
CGGCGACATA CGCGCGAATT GGCAGCCTAT GGCGGCTTGT GGAAGGTAAT GCCGGTCTTT
GCCGCTTTCA GTCTGCTGGT TGCGCTTTCG TCGGCCGGTC TGCCGGGTCT CAACGGTTTT
GTTGGTGAGT TTACGATCAT CACCGGCGCA TTCCGTTCAC CTTTGCTAGG ATGGATCTAC
GTTGCCTTTG CCGTCGGCGG TGTTGTATTG GCCGCTGCGT ATCTGCTCAA ACTCTTCCGC
TCGATCTTTA TGGGTGAGGT ACATCAGCCG GATAATACGA AGCTGCCCGA TTTGAATCGG
CGTGAGCTAA CGACATTTGC GCTTTTGAGC ATTCCTATCG TATTGATCGG CATCTATCCG
GTGTTCTTCT TTAATGGAAT GCAGTATAGT GTGGCTGCAC TCGTAGCAGA TTTGATGGCG
CAAGTGGCAG GGAGTTGA
 
Protein sequence
MNQPGFPLLS LILWLPAAGA LVLLFVPRAN AELARRVSLA TMAVVFLLSL LLPLRFETNP 
LQTTVVGASP VMQFVEEVPW LPIVGATYSL GIDGISLWLV MLTTFLGPIV VLSTWDSVHK
DVRNFQILLL ILQTAMIGVF LAQDLLLFYL FWEFTLIPMT FLIGIWGSQN RIYAARKFFL
YTFAGSVFML LALIALHILH RNAIAEIEPG FRGTFSFSRF VSDLRAGRLT LDSLTERLLF
GAFFLAFAVK VPLWPFHTWL PDAHVEAPTT GSVVLAGVLL KLGGYGMIRY NLTLFPAASQ
WAAPALAILA VIGIIYGAAV AFAQSDMKKL VAYSSVSHMG FVVLGIFALN TEGISGAVLQ
MVNHGLSTSA LFLMVGVLYE RRHTRELAAY GGLWKVMPVF AAFSLLVALS SAGLPGLNGF
VGEFTIITGA FRSPLLGWIY VAFAVGGVVL AAAYLLKLFR SIFMGEVHQP DNTKLPDLNR
RELTTFALLS IPIVLIGIYP VFFFNGMQYS VAALVADLMA QVAGS