Gene Cagg_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1046 
Symbol 
ID7268418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1296583 
End bp1297818 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content56% 
IMG OID643565891 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_002462396 
Protein GI219847963 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC CGATCACGGT GCCGACGCCA CGGCAGATTA TCGAGCCGGC AGTGGCCGGT 
CGCACCGAGA CGATGGTGCT GAATATGGGG CCGCATCATC CGAGTACCCA CGGTGTGTTG
CGGCTTGTGT TGGAACTTGA TGGCGAAGTG GTGGTGAATG TCGCTCCCGA TGTCGGCTAT
CTCCATACCG GCATCGAGAA GACGATGGAG AGCAAGACGT ACCAGAAGGC GGTGGTTCTG
ACCGACCGCA TGGATTATCT GGCTCCGCTC TCGAATAATC TCTGTTATGC ACTGGCCGTT
GAGAAGTTGC TCGGTATTGA GATACCTGAA CGTGCCCAGA TTGCCCGTGT CCTGCTCACC
GAGTTGCAGC GGATCGCTTC GCATTTGGTC TGGCTCGGTA CGCACGCCCT CGATCTCGCG
GCAATGAGTG TCTTCCTGTA CGCCTTCCGT GAGCGTGAGC AGATCCTCGA TATTTTCGAG
TTGGTCTCCG GCGCGCGTAT GATGACCAGC TATTTTCGGA TCGGTGGGCT GGCTTACGAT
CTACCCGCCG ATTTTATTCC TACCGTCGAG CAGTTCCTCG CCATTATGCC GTCGCGTATT
GATGAATACG AGGATTTACT GACGGCCAAC CCGCTCTGGC TCGAACGGAC GGTTGGGGTC
GGTGTCATTG ATGCCCAATC GGCAATTGCA CTTGGGCTGA CCGGCGCTAA TCTGCGCGCG
ACCGGTGTTG CTTACGATGT GCGTAAGGCA ATGCCCTACA GTGGCTACGA GACGTATTCG
TTTGAGATTC CGGTAGGCAA GAACGGCGAT ATATACGACC GCTATCGCGT GCGGGTCGCT
GAAATGCGCC AAAGTGTGAA GATTGTGCAG CAAGCAACTG AACGATTGCG CGAGTTGGGG
CCTGGTCCCG TGATAACTCC TGATCGTAAA GTTGCTCCAC CACCCAAGCG CGAGATTACC
GAGAGTATGG AGTCGCTCAT TCACCACTTT AAGCTTTGGA CCGAGGGCTT TAAGCCTCCA
CGGGGCGATG CCTACGTAAG CATCGAATCG CCCCGTGGCA TCCTCGGTTG CTATGTGGTG
AGTGATGGTA GTCCCAAGCC ATGGCGTGTC CATTTCCGCG CGCCGTCCTT CATCAATCTA
CAGAGTCTGG CTCATATGGC TAAGGGTCGG CTGGTTGCCG ATTTGGTCGC ATTGATCGCC
AGCCTCGATC CCGTCCTCGG TGAAGTGGAT CGCTGA
 
Protein sequence
MTEPITVPTP RQIIEPAVAG RTETMVLNMG PHHPSTHGVL RLVLELDGEV VVNVAPDVGY 
LHTGIEKTME SKTYQKAVVL TDRMDYLAPL SNNLCYALAV EKLLGIEIPE RAQIARVLLT
ELQRIASHLV WLGTHALDLA AMSVFLYAFR EREQILDIFE LVSGARMMTS YFRIGGLAYD
LPADFIPTVE QFLAIMPSRI DEYEDLLTAN PLWLERTVGV GVIDAQSAIA LGLTGANLRA
TGVAYDVRKA MPYSGYETYS FEIPVGKNGD IYDRYRVRVA EMRQSVKIVQ QATERLRELG
PGPVITPDRK VAPPPKREIT ESMESLIHHF KLWTEGFKPP RGDAYVSIES PRGILGCYVV
SDGSPKPWRV HFRAPSFINL QSLAHMAKGR LVADLVALIA SLDPVLGEVD R