Gene Cag_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1989 
Symbol 
ID3747368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2525145 
End bp2526851 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content47% 
IMG OID637774526 
Productribonuclease E and G 
Protein accessionYP_380280 
Protein GI78189942 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GTTCGACGAA GCAGTTGCTT ATGAATAAAA CCGGCGACGA AATTCAGGTT 
GCGCTGGTTG AAGAGGGACG ATTAGTAGAA TTAATTATTG AGCGTCCCGA AAGTCGCCGC
AGCATTGGCG ATATTTATCT TGGACGCGTC CATAAAGTGG TGGAGGGGCT GAAAGCCGCC
TTTGTTGATA TTGGGCAAAA GTCGGATGGC TTTTTGCACT TTTCAGATGT TGGCACCACA
AACGAAGATT ATCGCGCCCT TATTGAGGAC GATGATGACG ACGATGCCAT TATCGGCGAC
GATATTGAGA GCGATGAAGC AACCGGCCAG AACGATTCCG AATTTGATGA GGCAAGCGAT
GGCGAAACTA CGGTGAGTGC CCCAAAAACA GTGGCGCGTA GCAACAGCAA AAAGCCCTCA
TCAGGCGAGC AAAGTGGGGA AAAGCCACAA ACCTACACGC AAATGATTGC AGGTAAGCTC
AAGCCCAACG ACTCCATTTT AGTGCAGGTT ATTAAGGAGC CAATTAGTAG CAAAGGTTCT
CGCTTAACCT CCGACATTAC GATTGCTGGG CGTTTTATGG TACTCCTACC GTTTGGCGGT
GGGCAGGTTG CCGTATCGCG CCGAGTGGTA TCGCGCAAAG AGCGCTCGCG TTTAAAAAAG
TTAGTGCGCT CCATACTGCC CGAAGGTTTT GGCGCTATTA TTCGCACGGT TGCCGAAGAT
CAAGAAGAGG CGCTTCTCAA GCAAGATTTA GAAAAGCTGC TAACAAAGTG GAAGCAAATT
GAAGAAAAGC TACAAGATGC CACGCCACCG CAACTTATTT TTAAAGAGGA TACCATTATC
TCCAGCGTAT TGCGCGATTC GCTCACCTCT GACGTCAGCG AAATTGTGGC AAACTCGCCC
GCCATCTACA AAGAGACGCT TAATTACATT GAGTGGGCTG CTCCCGAAAT GGTAAAAAAT
GTAGCGCTTT ACCAAGGCAA GTTGCCACTT TTTGAAGGGT ACGCTATTGC AAAGGATGTT
GAATCCATTT TTTCGCGCAA AGTGTGGCTA AAGTCGGGCG GCTACATTAT TATTGAACAC
ACCGAAGCCA TGGTGGTTGT TGACGTGAAC AGCGGTCGCT ATGCTGCCAA GCGAGAGCAA
GAAGAAAATT CATTAAAAAC CAATCTTGAA GCGGCGCGTG AAGTGGTGCG CCAATTACGG
TTGCGCGATA TTGGGGGCAT TATTGTGGTT GATTTTATTG ATATGCTTGA TCCCAAAAAT
GCCAAAAAGA TTTATGATGC CGTAAAAACC GAGTTGCGCA ACGATCGCGC AAAGTCAAAC
ATTTTGCCAA TGTCGGACTT TGGCTTAATG CAAATTACCC GCGAACGAAT TCGCCCCAGC
CTTATGCAGC GCATGGGCGA TCAATGCCCT GCCTGTGGAG GTACGGGCAT TGTACAAGCG
CGTTTTACCA CCATTAACCA AATTGAGCGT TGGCTTCGCA AATATGCGTT GCAGCACCCG
CTTCGCTTTC AGCAGCTTGA TCTTTACGTA AGCCCAACGG TTTTAGAGCC ACTGCAAAAC
AGCGACATGA AAACCGAAAT GAAGTGGTTT TTGCAACACA TGCTTTTTGT TACCGTTAAA
GGCGATGAAA GCCTTCGTAG CGATGACTTT AGATTTTACA ATCGCAAAAA CAATAAGGAT
ATAACTGCCG AATATGGCGA GTTATAG
 
Protein sequence
MKKSSTKQLL MNKTGDEIQV ALVEEGRLVE LIIERPESRR SIGDIYLGRV HKVVEGLKAA 
FVDIGQKSDG FLHFSDVGTT NEDYRALIED DDDDDAIIGD DIESDEATGQ NDSEFDEASD
GETTVSAPKT VARSNSKKPS SGEQSGEKPQ TYTQMIAGKL KPNDSILVQV IKEPISSKGS
RLTSDITIAG RFMVLLPFGG GQVAVSRRVV SRKERSRLKK LVRSILPEGF GAIIRTVAED
QEEALLKQDL EKLLTKWKQI EEKLQDATPP QLIFKEDTII SSVLRDSLTS DVSEIVANSP
AIYKETLNYI EWAAPEMVKN VALYQGKLPL FEGYAIAKDV ESIFSRKVWL KSGGYIIIEH
TEAMVVVDVN SGRYAAKREQ EENSLKTNLE AAREVVRQLR LRDIGGIIVV DFIDMLDPKN
AKKIYDAVKT ELRNDRAKSN ILPMSDFGLM QITRERIRPS LMQRMGDQCP ACGGTGIVQA
RFTTINQIER WLRKYALQHP LRFQQLDLYV SPTVLEPLQN SDMKTEMKWF LQHMLFVTVK
GDESLRSDDF RFYNRKNNKD ITAEYGEL