Gene Cag_0813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0813 
Symbol 
ID3747467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1138419 
End bp1140074 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content46% 
IMG OID637773343 
Productpeptidase S41A, C-terminal protease 
Protein accessionYP_379122 
Protein GI78188784 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.338188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGTA TTGTTACTGT TGCCCTAATG CTTGTAGTGC TTTTGTTTGG GATTTTTCTT 
GGAACCCGAA TAAGCGGTAG GCGTGTTGAG AGTAGCGGTG CTGGTCAACA AAAAATTGTT
GAGGCATATA ATCTTATGCG GCAGTTTTAT GTTGATGAGG TGGGTGGTGA TAGTTTAGCG
GGGGCTGGCA TTGAGGGTAT GGCTGGATTG CTTGATCCCC ATACGGTTTA TTTGGAGCCT
GAAAAGGTTA CTTATGAGCA GGCTGAGTTT GAAGGCAATT TTGATGGCAT TGGTATTGAG
TTTGATATTG TTAACGATAC GCTGTTGGTG GTTACTCCGC TTTCGGGCGG ACCAAGTGCG
GCGGTAGGTT TGGCTTCGGG CGATAGAATT ATTGGTATTG ATGGCGTTTC GGCAATTGGT
ATTACCCAGC GTGATGTGTT GAAAAAGTTG CGTGGCAAGC AAGGTTCAAT GGTTCAGCTT
GATGTGTTTC GTCCGCTTGA TGGTAAGCGT ATGGACTTTT CGGTAACGCG TGGCAAAATT
TCCACGTCAA GCATTGAAGC GGCGTTTATG GTGAACCAGC AGGTGGGGTA TATTCGTTTA
AGTCGTTTTA TTGCGACAAC GGCTGATGAG TTTCGTAGTT CCCTTCAGCT TTTAAAACAG
CAAGGCATGA AGCGCTTGCT GCTTGATATG CGTGGCAATC CCGGTGGTTT TCTTGAGCAA
GCCGTAGCCG TAGCCGATGA GTTTTTAAGC GAGGGCAAAT TGATTGTTTA TACAAAGAGC
CGTAAGGGAA GTTTGCCTGA TGAGCGTTAC GAGGCTCGTT CGGGCGATAC TTTTGAGCGA
GGCGATGTGG TGGTGCTTAT TGACCGTGGA AGCGCCTCAG CGGCTGAAAT TGTTGCTGGA
GCGTTGCAAG ATAATAAGCG GGCTGTGGTG GTTGGTGAGC CTTCGTTTGG CAAAGGGTTG
GTGCAGCAGC AATTACCCTT TGCGGATGGT TCGGCGTTAC GCTTAACCGT TGCAAGGTAC
TACACACCTT CGGGGCGCCA AATTCAGCGG GTTTATCGCA AAGGTGTGGC TGGTCGTGAG
CACTATTTTG AAGAGAGTAT GAGCAACATT TCACCCAACA AACTTTTTGA TGATCCTGAT
ACCTTGCTTT ATTATGAGAA TAACAATGTG TCGGTTTACA ACACCTCCAC ACTGCCATCA
TTATTGCTCT CGTTAAAAGG CAAAAAAGGT GAAAATAATC GTCTTACTGA CTTGCGCGAT
GCTGGGGGTA TTATTCCAAA TTATTGGGTT AACGCAAGGA GTTATTCTTC ATTTTATCAA
GAGCTCTATC GCACAGGATT GTATGATGAA GTAGCTCGTA AACTGCTTGA TGATCCTCAC
TCGTTAGTGC AGAAGTATCG TGATTCGCTT GAGCGTTTTA TGACAAGTTA TACCGAAGAG
CCAAACTTTG AAGCCTTGCT TGCTAAAGCG TGCCAATCAA AAGGTGTTCG ATTTAATCGT
GTGGCGTTGC TGCAAGATCG TCATGCCATT GTGTTGGCGC TTAAAGGACG CATGGCACAC
CAACTTTTTG GCTCAAGTGG TCAAATAAAA TTTTATGTTA AAACTGCTGA TCCACTTGTT
CGAGTGGCAA CTTCAGTTCC GCTTTCAACT CGCTAA
 
Protein sequence
MSRIVTVALM LVVLLFGIFL GTRISGRRVE SSGAGQQKIV EAYNLMRQFY VDEVGGDSLA 
GAGIEGMAGL LDPHTVYLEP EKVTYEQAEF EGNFDGIGIE FDIVNDTLLV VTPLSGGPSA
AVGLASGDRI IGIDGVSAIG ITQRDVLKKL RGKQGSMVQL DVFRPLDGKR MDFSVTRGKI
STSSIEAAFM VNQQVGYIRL SRFIATTADE FRSSLQLLKQ QGMKRLLLDM RGNPGGFLEQ
AVAVADEFLS EGKLIVYTKS RKGSLPDERY EARSGDTFER GDVVVLIDRG SASAAEIVAG
ALQDNKRAVV VGEPSFGKGL VQQQLPFADG SALRLTVARY YTPSGRQIQR VYRKGVAGRE
HYFEESMSNI SPNKLFDDPD TLLYYENNNV SVYNTSTLPS LLLSLKGKKG ENNRLTDLRD
AGGIIPNYWV NARSYSSFYQ ELYRTGLYDE VARKLLDDPH SLVQKYRDSL ERFMTSYTEE
PNFEALLAKA CQSKGVRFNR VALLQDRHAI VLALKGRMAH QLFGSSGQIK FYVKTADPLV
RVATSVPLST R