Gene Cagg_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1038 
Symbol 
ID7268410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1286623 
End bp1288551 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content56% 
IMG OID643565883 
Productproton-translocating NADH-quinone oxidoreductase, chain L 
Protein accessionYP_002462388 
Protein GI219847955 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID[TIGR01974] proton-translocating NADH-quinone oxidoreductase, chain L 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTGT TAATCTGGCT GATACCGCTG CTGCCATTTA TCGGTTTTTT GTTAAACGTG 
TTTGTGATTC GTCGCGAGCG TGAGGCAGGA TTGGTGGCGA GCGGCATGGT TGCGGCAGCG
TTTGTCGTGA CGCTGATCGC AGTCGGTATG TTGGCCGGCA TGCCACCGGA AGAGCGACGG
ATTGTGAGTA CGGCTTGGGA ATGGATCAGT ACCGGTAGTT TTCGAGTGCC GTTTGCCGTG
ATGTTCGATC CGCTGACGGC AGTCATGGCG CTGTTGATTA CCGGTGTTGG TGCGCTCATT
CATGTCTACT CGATCGGCTA TATGCACGGC GATCCGCGGG TGGTGCGCTA TTTCGCCTAC
CTCAACTTGT TCGTCACTAT GATGCTCTTT CTGGTGATGG CAAACAATTT GTTGCTGCTC
TTCCTCGGGT GGGAAGGCGT TGGCCTGTGC TCGTTCTTGT TGATCGGGTT CTGGTTCGAG
CGCAAATCGG CCAGTGAAGC GGCAGTCAAA GCGTTTGTCG TCAACCGAAT TGGTGATGCG
GCGTTTATTT TGGCGATGTT GGCCATCTTT GCCTATTTTG GCACCCTCAA CTTTTACGGC
GATGGCGAAA GTGGTCAACT CGGTTTTCTC GAGCGGGTGG GGGATATTGC CGGTCTGAAG
ATTGGCCCTA CGTGGCAGCC GGTCTTTTTG AGTACCGTTA TCTCGTTTCT CTTACTAATT
GGCGCAACCG GTAAGAGTGC GCAATTTCCC CTCTTCGTCT GGCTCCCTGA TGCAATGGCC
GGTCCTACAC CGGTGTCGGC GCTCATCCAC GCCGCAACCA TGGTGACCGG TGGGGTGTAC
CTGATGGCGC GTACCGAACC GCTCTTCGTG GCCTCGTTTA CAACGCAAGG CTGGGTGGCA
TGGATCGGGG CGTTGACTGC GCTGCTCGCC GGTACGGCGG CAATGGCTCA ATGGGATATT
AAGCGGGTGC TGGCGTATAG TACTGTCTCC CAGTTGGGTT TTATGGTGGC AGCGTGTGGT
ATGGGAGCGT ATGTCGCTGC GATTTTTCAC TTGTTAACCC ACGGGATTTT TAAAGCGCTG
CTCTTTCTGG CTGCCGGTTC GGTCATCCAT GGGACGCATG ATACCCAAGA TATGCGGCGT
ATGGGTGGCT TGAAGGATGC AATGCCAATC ACCTTCCGTA CCTATCTGAT CGGGGCGTTG
GCGCTGGCCG GTATTGTCCC GTTTGCCGGC TTCTGGAGTA AAGATGAAAT ATTGGCGCAC
GCGGTGAGTC ATGGGCACAC CCCGATCTTT CTGATCCTCT TCCTCACCTC GCTGCTCACG
GCCTTCTATA TGGGCCGGCA GATCGCGTTG GTCTTCTTCG GGACACAACG TGATCCGAGC
TATCATCCGC ACGAAAGTCC GTCGGTGATG ACGGTACCGC TGATCGTGTT GGCTGTGGGG
GCGGTGATTG GTGGTGCCAT CAATCTACCG GTGTTGCACT GGTTGACCGA CTGGCTCGAA
CCGGTGTTGC ATGAGCAGGC CGGTGAGTTC AATCTGTGGC TAGCGTTGGT AGCCACTATC
GGTGCGGTCG GCATGGGCTA TCTTGGCTGG TGGGTCTACA CCGTCAATGC GGCTAAGATC
AAGATCGGCG GCAAAGACCC GGCCTACCGC TACAGCGGTG ATATTTGGGA AGGGATGGAG
GAAGCGTGGT ACCTTGATCG CTTCTACCAG CGCACAGTGG TCGCTGGCTT CGAGCGGCTG
GCCGATTTTC TGGCCCGCGT GTTCGATCCG CAGGGTGTTG ATGGATTGGT GATGGGTATT
GGCCGCTTCT TTGGTAGTTT GGCCAATGGG GTGCGTGCCT TGCAAACCGG GTATGTACGC
ACCTATGCGC TCGTCTTCAC CGTTGGTGTG TTACTCGTTC TTGGCTTTAT GCTCTGGTTT
GCCCGCTAG
 
Protein sequence
MELLIWLIPL LPFIGFLLNV FVIRREREAG LVASGMVAAA FVVTLIAVGM LAGMPPEERR 
IVSTAWEWIS TGSFRVPFAV MFDPLTAVMA LLITGVGALI HVYSIGYMHG DPRVVRYFAY
LNLFVTMMLF LVMANNLLLL FLGWEGVGLC SFLLIGFWFE RKSASEAAVK AFVVNRIGDA
AFILAMLAIF AYFGTLNFYG DGESGQLGFL ERVGDIAGLK IGPTWQPVFL STVISFLLLI
GATGKSAQFP LFVWLPDAMA GPTPVSALIH AATMVTGGVY LMARTEPLFV ASFTTQGWVA
WIGALTALLA GTAAMAQWDI KRVLAYSTVS QLGFMVAACG MGAYVAAIFH LLTHGIFKAL
LFLAAGSVIH GTHDTQDMRR MGGLKDAMPI TFRTYLIGAL ALAGIVPFAG FWSKDEILAH
AVSHGHTPIF LILFLTSLLT AFYMGRQIAL VFFGTQRDPS YHPHESPSVM TVPLIVLAVG
AVIGGAINLP VLHWLTDWLE PVLHEQAGEF NLWLALVATI GAVGMGYLGW WVYTVNAAKI
KIGGKDPAYR YSGDIWEGME EAWYLDRFYQ RTVVAGFERL ADFLARVFDP QGVDGLVMGI
GRFFGSLANG VRALQTGYVR TYALVFTVGV LLVLGFMLWF AR