Gene Cagg_1011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1011 
Symbol 
ID7268383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1249782 
End bp1251008 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content56% 
IMG OID643565857 
ProductMazG family protein 
Protein accessionYP_002462362 
Protein GI219847929 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCA CACTCCATAC TGTGCTTCAT CTTGCGGTGA ATCAGGACCT GATCGACCCA 
GCAGCATTAC AGGTGTGGTC GGTTGAACGG TTGCTTCAAC CATCACCCCG GCCAACAGCA
ACCAGGGTGT TACCATGGGT AGAACAGCAA GGTTTGGGTA GCTATCAACC GGTACGATTG
CCATACCCCC TGGCAACACA TACCCCGGCT CTGATCTGGG GTGAACCGGC TACTCTCAAT
CTGACCGCAC TGGCTACCCT TTTGGCTGAA CGTTACCCGA CCAACCATCG GTTGTTTGTG
TTGACCGCAC CGGAGGGGAG TACGATACCG TTGACCGTGG CCGAACTCGC TACGGCAATC
CTGCCACCAG ATGAAGTGAT CGGGATCGTT GTTCCGGCAT TGTCGCTTGC CGAAGACCAA
CGGAGCCTTG ATCGATTGCG GTGGGTGATC GGTCGTCTGT GCGGGCCAGA CGGCTGCCCA
TGGGATGTAC GCCAAACCCA TCAGAGCCTG CGGAGAACGT TCCTCGAAGA GGTGTATGAA
GCGCTGGAAG CAATCGATAC CGGCGACATG CGTCACCTCT GTGAAGAGCT TGGTGATGTG
CTGATGCAAG TATTCGTGCA TAGTGAAATG GCCCGCCAGG CCGGCTATTT TACCCTCGAA
TCGGTCGTTC AGCACGTCGC CGATAAACTG ATCTTTCGCC ATCCGCACGT CTTCGGCACA
ACCAGTGTGA CCGACACCGG TGAAGTTCTC CAAAACTGGG AGGCGTTGAA GGCGCAAGAA
TTGGCTACTA AAGGCCAGGT ACGTAGCAGC GCGCTCGATG GTATTCCGTC AGCATTGCCA
GCATTGGCCA CTGCCCAGAC GCTGGCGCGT AAGGCAATCC AAGCCGGGTT TACGTGGACG
ACAATTGAGC AAGTTTGGGC CAAAATTGCC GAAGAACTGG CCGAGTTACG CGAAGCTGAT
GATAGTGCGG CCCAGAAGCG GGAACTCGGT GATCTGCTGT TCGCGCTGAC CATACTGGCC
CATTGGCTCC AACTCGATGC AGAATCGGCC CTGCGTGAAG CAAATCTGCG GTTTAAACAA
CGGTTTCAAC AGGTCGAACA GATGGCTGCT CGTTCTGGAC GGAACCTGCG CGATTGCACA
CTCGATGAAC TGATCGCATG GTGGACGGCA GCGAAGATGA TGAGGAACGA ACACACCGAT
GGCACCACCA ATTCCGCTGT ACCGTAA
 
Protein sequence
MSTTLHTVLH LAVNQDLIDP AALQVWSVER LLQPSPRPTA TRVLPWVEQQ GLGSYQPVRL 
PYPLATHTPA LIWGEPATLN LTALATLLAE RYPTNHRLFV LTAPEGSTIP LTVAELATAI
LPPDEVIGIV VPALSLAEDQ RSLDRLRWVI GRLCGPDGCP WDVRQTHQSL RRTFLEEVYE
ALEAIDTGDM RHLCEELGDV LMQVFVHSEM ARQAGYFTLE SVVQHVADKL IFRHPHVFGT
TSVTDTGEVL QNWEALKAQE LATKGQVRSS ALDGIPSALP ALATAQTLAR KAIQAGFTWT
TIEQVWAKIA EELAELREAD DSAAQKRELG DLLFALTILA HWLQLDAESA LREANLRFKQ
RFQQVEQMAA RSGRNLRDCT LDELIAWWTA AKMMRNEHTD GTTNSAVP