Gene Cag_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1119 
Symbol 
ID3748337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1509145 
End bp1510434 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content49% 
IMG OID637773650 
ProductFmu, rRNA SAM-dependent methyltransferase 
Protein accessionYP_379424 
Protein GI78189086 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases
[COG0781] Transcription termination factor 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB
[TIGR01951] transcription antitermination factor NusB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACTG CACGAGAAGT AGCGCTGCAA GTGCTGCAAG CGTTAGAAAA AAGTAGCGAG 
CGCTCCGATG CGTTGTTGCA CAAACATCTT GAAGCGTCGG GATTGGAGCG CGTTGAGCGG
GCACTTGCCA CCGAGTTGGT GAATGGCGTG TTGCGTTGGC AGCTTCAGCT TGATAGCCGC
ATTAGCCTTG CTTACCATCA CAAGCTTGAG GCAGCGGCTC CTGTATTGCG CAATATTTTA
CGATTAGGCG CCTATCAGCT TCTTTTTTTA ACCAAAATTC CTCGTTGGGC GGCGGTAAAC
GAGTGTGTAA AACTTGCTCG AAAGTATAAG GGTGAGCGAA TGGCAAAGTT GGTTAATGGT
GTGTTGCGCC ATCTTGATGG TGGCAATGAG GCGTTTGAAA AGCTGTTGCA AGGGCGTTCA
CAAGCCGAGC AACTTGCACT CCAATTTTCG CATCCCGCAT GGTTAATTGA GCGTTGGCTT
GCTACGTATG GCGAGCTGAA AACCCGACAA TTGTTGCATT ACAACAATCA AGCGCCAATG
ATGGGTTTTC GCATTAATCG GTTGAAAGCT GATGCAAAGG ATTTTTTTGA TAATCCTACC
TTTGCGGCGG CAATGGAGCC GTGCGAATTG CCTTATTGTT TTCTTTCGCG AGAATTTTCG
CTTTTTGAGA CGGCGTTGCA AGAGGGCGTT TTAAGTGTGC AAAATCCCAC GCAAGCGTTG
GCGCCGTTGT TGCTTAATCC AGCCCCACAG AGTGTGGTGA TTGACCTTTG TGCGGCACCG
GGCGGTAAAG CAATGTTTAT GGCGGAATTA ATGCACAATC AAGGGGAGGT GATTGCTGTT
GATCAGTATG AGCAAAAGTT ACGGAAGCTT GAAAGCCATG CGCAGGCGTT GGGGCTTACT
ATTATTCGCA CGGTAGCTGG CGATGCTCGG AGTGTTGCGC CAAATCTGCA AGCTGATGCG
GTGTTGCTTG ATGCTCCTTG TTCGGGCACG GGCGTTATTG GGCGTCGTGC TGAGTTACGC
TGGAAGGTAA CGCCTGCAAT GGTTGCGGAA TTGCAAGGGT TGCAAGCTGA ATTATTGGAT
CATGCGGCTA CGTTGCTTAA GCCTCAAGGT AGGTTAGTGT ATGCTACTTG TTCTATTGAG
CCTGAAGAGA ATGGTGAGCA AGTTGCAGCT TTTTTGCAAC GTCATCCTAA TTTTGTTGCC
GAACCTCATG CCCAGCACAT GACGTTGCCG GGCGATCGTT ATGGCTACGA TGGCGGGTTT
TGCCAATGCC TCCGTAAACT GCAAGAATAA
 
Protein sequence
MSTAREVALQ VLQALEKSSE RSDALLHKHL EASGLERVER ALATELVNGV LRWQLQLDSR 
ISLAYHHKLE AAAPVLRNIL RLGAYQLLFL TKIPRWAAVN ECVKLARKYK GERMAKLVNG
VLRHLDGGNE AFEKLLQGRS QAEQLALQFS HPAWLIERWL ATYGELKTRQ LLHYNNQAPM
MGFRINRLKA DAKDFFDNPT FAAAMEPCEL PYCFLSREFS LFETALQEGV LSVQNPTQAL
APLLLNPAPQ SVVIDLCAAP GGKAMFMAEL MHNQGEVIAV DQYEQKLRKL ESHAQALGLT
IIRTVAGDAR SVAPNLQADA VLLDAPCSGT GVIGRRAELR WKVTPAMVAE LQGLQAELLD
HAATLLKPQG RLVYATCSIE PEENGEQVAA FLQRHPNFVA EPHAQHMTLP GDRYGYDGGF
CQCLRKLQE