Gene Cag_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1991 
Symbol 
ID3747370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2527749 
End bp2529497 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content50% 
IMG OID637774528 
Productpeptidase S41A, C-terminal protease 
Protein accessionYP_380282 
Protein GI78189944 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCCGC AAAGAGAGAG CAAGCCGCGC CATAAGCAAA GCCAACGCAA CGGCTGGCGA 
ATTATTCAAC GCATGGCAAC GGCACTGCTT GCACTCTCTC TGCCTACCAC CACGCTTGCT
TACCCTCAAG CGGAAAGCCA AAGCTTTGCA GTTGTTTCAA GCATTGAGCT TCTCTCCGAA
GTATATCGCG AATTAGCAGC AGGCTACGTT GAGCCGCTTG ATACCGCCCT CTTAATGAAA
ACGGGCATTC GAGGCATGTT GCGTAGCCTT GATCCCTACA CCACCTTACT TGAGCGCGAT
GATGCCGATG AATTAGCCGA TATTACTCGT GGACGCTATG TGGGCATTGG CATATCGCTC
GCTACGCTCG AAAAAAAGCT CTACGTTACC GCTGTTAATG AGGAGAGCCC AGCCGCAGCG
GCAGGCATTC GCACAGGCGA TGCCATTCTT GCCATTAACG AGGCGAAAGT TGCCAATATA
GCGGTTGATA GCTTAAGAAC TCTTTTGCAC GGCACCAATG GTTCACCCAT CACCTTTCAA
TTAGAGCGAC GAGGCAGCGC CCCACGAACA ACCACCGTAC AACGCCAATC GGTGCCGCTA
AAAAGCGTAC CATATTACGA ATTACACAAC AACATTGGCT ACATAGCGCT TGATGGCTTT
ACCACCCGCT CACCCCATGA AGTACGTAGC GCATGGCAGT CATTGCAACA GCAAGCAACC
GCTAACAAGC AACCCTTACG TGGCTTAATA GTTGATTTAC GCGACAACTC AGGTGGCTTG
CTTGATGCCG CATTGGAAAT TACCTCGCTT TTTGTGCCCA ACGGCAGCGA AGTGGTTTCC
ATTAAAGGGC GCTCTACCCA TAGCCATAGC ACTCTTAAAA CCACCACCGA GCCGTTAGAT
GCAACACTCC CCGTTGCGCT GCTGATTAAT GGCGATACCG CTTCGGCGGC TGAAATTGTA
GCTGGTGCTC TGCAAGATGT TGATCGAGCC ATTATTCTTG GCGAACGCTC TTACGGCAAA
GGCTTAGTAC AATCGGTAAA AAAACTCTCT TATGGCAACA CACTGAAATT TACCACAGCA
AAATATTACA CCCCTTCAGG GCGCCTCATT CAAAAAGAGC TGAAAAAAGA GAGCTCACCA
CACTCAACCA ACGCTGATAG CAAACAAGCT CTTGCCTCCG CAGTACCCGA TACAACACAA
CGCTTTTACA CCCGCAATCA CCGTATTGTG TATGGGGGAG GTGGAATTAT GCCCGATGTG
GAGATAAAGG AACCAGCCTC GCCCTACGTA ACCGCATTGC GCAAACGAGG GATGATTTTT
CTTTTTGCTA ATGAATGGTA CGCCACCCAT TCTGATGATG CTCCAGCCTC ATCCGCTTTG
CTACCAAGCC AAACGGAGCT GTTAGCGCAC TTTGAAAAAT TCCTTCAGCA AAAAGAGTTT
CGCTACACCA GCAATGCCGC AAAACGTTTA GAGGAATTAA AAAGCGCCAT GAAAGAGTCA
GGCAGAGAGA ATCCTGAAGC CTTACGCACT ATGGAGCGCG AAGTTGAACT TGCAGATACA
GAGGAGCGCA ATCGTGAAGC CAAGCAAGTA GCCGTAGCGC TTGAGTCAGC AATTTTGCGC
CATGCCAGCG AACACTTAGC ACGCCAAGCC GAACTTCGCC ACGATGCGCT TGTGTTGCAA
GCTGAAGAGC TGCTTATCTA TCCCGCCCGT TATCGTGCTA TGCTGAAAGC TTCAAGCACA
AGAAAATAG
 
Protein sequence
MFPQRESKPR HKQSQRNGWR IIQRMATALL ALSLPTTTLA YPQAESQSFA VVSSIELLSE 
VYRELAAGYV EPLDTALLMK TGIRGMLRSL DPYTTLLERD DADELADITR GRYVGIGISL
ATLEKKLYVT AVNEESPAAA AGIRTGDAIL AINEAKVANI AVDSLRTLLH GTNGSPITFQ
LERRGSAPRT TTVQRQSVPL KSVPYYELHN NIGYIALDGF TTRSPHEVRS AWQSLQQQAT
ANKQPLRGLI VDLRDNSGGL LDAALEITSL FVPNGSEVVS IKGRSTHSHS TLKTTTEPLD
ATLPVALLIN GDTASAAEIV AGALQDVDRA IILGERSYGK GLVQSVKKLS YGNTLKFTTA
KYYTPSGRLI QKELKKESSP HSTNADSKQA LASAVPDTTQ RFYTRNHRIV YGGGGIMPDV
EIKEPASPYV TALRKRGMIF LFANEWYATH SDDAPASSAL LPSQTELLAH FEKFLQQKEF
RYTSNAAKRL EELKSAMKES GRENPEALRT MEREVELADT EERNREAKQV AVALESAILR
HASEHLARQA ELRHDALVLQ AEELLIYPAR YRAMLKASST RK