Gene Cagg_3122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3122 
Symbol 
ID7269540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3782184 
End bp3784094 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content60% 
IMG OID643567943 
Productmagnesium chelatase ATPase subunit D 
Protein accessionYP_002464416 
Protein GI219849983 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1239] Mg-chelatase subunit ChlI 
TIGRFAM ID[TIGR02031] magnesium chelatase ATPase subunit D 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAAA GAGTGAAAGA ACTACCGACA GGGCCACTTC CCTTCACGGC AATCGTCGGC 
CTCGAAGCAG CCCGACAGGC CCTGCTCCTG TTGGCAGTCG ATCCGCTGCT GACCGGTGTC
GCGATTGGTG CAGGAGCAGG TACCGGTAAA AGTGCGTTGG TGCGCGCCTT TGCCCGTATG
CTGGCCGGTG GGCGTGAGTT TGACCCCACC TTACCCTGCA ATTTGGTCGA GATGCCGGTC
GGCGTAAGCG AAGACCGACT CTTGGGTGGA ATTGACATCG AAGCAACTCT CGCGTTGGGC
GAACGGGTCC ATCGGAGTGG CCTCCTCGCG CGCGCCAACG GCGGCTTGTT GTACGTTGAT
AGCGTCAACT TGCTCGACGA TAGCACGATC AACCACATTC TCGGTGCGTT GGATAGCGGC
GTGGTGCGGG TCGAGCGAGA AGGAATATCC GTTGTGGAAC CGGCCCGTTT TGTGCTGCTC
GTCACATATG ACCCGGCGGA GGGGCCTCCT CGTCGCCATT TGCTCGACCG CCTAGGGTTG
ATCGTAGCGC CGATCGGTAA AGCCCCGGTG ACGACGCGGG CCGAGGTTGT GCGTCGCAAC
CTCCAACCCC ACCTCGATTA CGAAGATGAT GAGGCGCTGG TGTTAGCCGG CATTCTGGCG
GCACGTGAAC TGTTACCCAA CGTCACCATC ACCGATGATC AGATTCGGCA ATTGAGTCTG
ACAGCCCTAG CCCTAGGAAT CGAAGGGCAT CGGGCCGATA TGTTCGCGGT GCGGGCAGCA
CGGGCAGCAG CCGCCCTGGC CGGACGTGAT GAGGTGAGTA ACGAAGACCT TGAGCTGGCC
GTGCGGTTGG TGATGCTGCC GCGTGCTACA CGGCTACCAG AAATGACACC GGCAGAATCT
CAACCACCAC CACCAACCCC AGAACCGGCC CCACCACCAC CGAGTCAGCA ACAAGAAGAC
GACGAGCAGA ACAACGATGA CGATCAACCA CCGACGCCAC CAGACGAAGA GTTGACGGTC
GAGGATCTCA TCCTGGCCGC AATGGAGACG GAAGTACCGC CGGACATCCT CGAAACGCCG
TTTACCGTGC GCCGACGCGG GCGGAGTGGT TCACGTGGCA CCATTTCCGG ACAACGCGGT
CGCCATATTC GCTCGGTACC GGGGAATCCG GCTCAAGGGC GGCTCGATGT GATTGCCACA
CTCCGTGCCG CCGCACCGTG GCAACGGCTA CGAGCAAGTG ACCATCCGCC ACATCAGCAT
CGGCGTGGAC GCATTCACTT GCGTGCCGAA GACTTACACA TCAAGAAATA CCGTTCTAAG
GCGGGGACAC TCTTCTGTTT TCTGGTTGAT GCCAGCGGTT CAATGGCTCT GCACCGCATG
CGACAGGCGA AAGGCGCCGT CAATTCCCTC TTGCAGCAAG CCTACGTTCA CCGCGATCAG
GTGGCGTTGC TGGCTTTCCG TGGTGAGCGG GCCGATCTGC TCCTTCCTCC ATCACAAAGT
GTCGAACTGG CCAAACGCGC CCTCGACGTG TTGCCAACCG GTGGAGGAAC CCCGCTCGCA
GCGGCGCTAT TGGCGGCGTA CCAGATCAGT GAGCAAGCAC GGGCACGTGG TATTTTCCGC
ACTACCATCG TGCTGATCAC CGATGGGCGA CCGAATGTAC CGCTTAAGGC CGATCCCACG
ATGGACAAAA ATCGCCGGCT TGAGCAAGCT CGTCAAGAGG TACAGCAACT AGCCGGTCGG
CTGCGCGCTG CCGGTGTCGG TGCTGTGGTC ATTGATACCC AGCGCAGTTT CGTTTCACGG
GGTGAAGCCC AGCAATTGGC GGTATGGCTC GGTGGGCGCT ACGTATATCT GCCAAATGGA
CGAGGGGATC AGATTGCAAA TGCCGTCATT GCGGCCAGCG AAGAGATGTA G
 
Protein sequence
MVQRVKELPT GPLPFTAIVG LEAARQALLL LAVDPLLTGV AIGAGAGTGK SALVRAFARM 
LAGGREFDPT LPCNLVEMPV GVSEDRLLGG IDIEATLALG ERVHRSGLLA RANGGLLYVD
SVNLLDDSTI NHILGALDSG VVRVEREGIS VVEPARFVLL VTYDPAEGPP RRHLLDRLGL
IVAPIGKAPV TTRAEVVRRN LQPHLDYEDD EALVLAGILA ARELLPNVTI TDDQIRQLSL
TALALGIEGH RADMFAVRAA RAAAALAGRD EVSNEDLELA VRLVMLPRAT RLPEMTPAES
QPPPPTPEPA PPPPSQQQED DEQNNDDDQP PTPPDEELTV EDLILAAMET EVPPDILETP
FTVRRRGRSG SRGTISGQRG RHIRSVPGNP AQGRLDVIAT LRAAAPWQRL RASDHPPHQH
RRGRIHLRAE DLHIKKYRSK AGTLFCFLVD ASGSMALHRM RQAKGAVNSL LQQAYVHRDQ
VALLAFRGER ADLLLPPSQS VELAKRALDV LPTGGGTPLA AALLAAYQIS EQARARGIFR
TTIVLITDGR PNVPLKADPT MDKNRRLEQA RQEVQQLAGR LRAAGVGAVV IDTQRSFVSR
GEAQQLAVWL GGRYVYLPNG RGDQIANAVI AASEEM