Gene Cag_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0047 
Symbol 
ID3747246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp52929 
End bp53915 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content53% 
IMG OID637772573 
Productmethyltransferase 
Protein accessionYP_378369 
Protein GI78188031 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0275] Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis 
TIGRFAM ID[TIGR00006] S-adenosyl-methyltransferase MraW 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.509185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACTGC ACGATACTTA TCACGATCCG GTGCTTGCGG CGGAGGTAGT TGCTACCCTT 
GTGCAGCGTT CGGGCATTTA CGTTGATGGC ACGCTTGGTG GTGGCAGCCA CTCCCTTGCG
CTGTTGCAAG CCCTGCAAGC GCAAGGGTTG CTTGAATCAT CTTTACTGAT TGGTATTGAT
CAGGATAGCG ATGCGCTGGC TATGGCTGCC GAGCGTTTAC AAGCGTGGCA ACCTTACACT
CGCTTGCTGA AAGGGAACTT TCGTGATATG GCTTCGCTTG TTCAGCAACT CTGCGATGCT
GAAGGGCGTG CTTGTGCCGT AACGGGCGTG TTGCTGGATC TTGGGGTCTC TTCGTTTCAG
CTTGATACGG CTGAGCGTGG TTTTAGCTAC ATGCGTTCAG GTCCGCTTGA TATGCGTATG
GATAACACGG CACCGCTTAC CGCGGCGGAG CTTATCAATC ATGCAGATGA AGCGGAGCTG
GCGCGTATTT TTTATCACTA CGGCGAAGAG CCTCGAAGCC GTGCGTTAGC GCGTGCGGTT
GTGCAGCAGC GCGAAAAAAT GGGCAATTTT ACAACCACCG AAGAGCTTGC AGCGTTAGTG
CGGCGCTTAA CGCATGGTGG CGAAAAAGCT GTTATTAAAA CGCTTTCGCG CCTGTTTCAA
GCCTTACGCA TTGCCGTGAA TGATGAACTT GGTGCTTTGC ATGAGGTGCT TGAGGGTGCG
CTTGAGTTGC TTGATGGCAA CGGACGTTTA GCCGTTATGA GCTATCATTC GCTTGAAGAT
AGGGTGGTGA AGCACTTTTT TACCCATCAT GCGCAATGCG ATTGGGGACC CAAAGGGGTT
GCGTTACGGG AGCCACTAAG CCAAGGTGCC CTCACCATTG TTACCAAACG CCCCATGCTT
GCCTCTGCCG ATGAAATTGA GCGCAATCCT CGCGCCCGAA GCGCAAAATT GCGAGTTGCT
GCCAAAAATC AGCCAAAAAC CATTTAA
 
Protein sequence
MALHDTYHDP VLAAEVVATL VQRSGIYVDG TLGGGSHSLA LLQALQAQGL LESSLLIGID 
QDSDALAMAA ERLQAWQPYT RLLKGNFRDM ASLVQQLCDA EGRACAVTGV LLDLGVSSFQ
LDTAERGFSY MRSGPLDMRM DNTAPLTAAE LINHADEAEL ARIFYHYGEE PRSRALARAV
VQQREKMGNF TTTEELAALV RRLTHGGEKA VIKTLSRLFQ ALRIAVNDEL GALHEVLEGA
LELLDGNGRL AVMSYHSLED RVVKHFFTHH AQCDWGPKGV ALREPLSQGA LTIVTKRPML
ASADEIERNP RARSAKLRVA AKNQPKTI