Gene Cag_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0729 
Symbol 
ID3747425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1005202 
End bp1007253 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content47% 
IMG OID637773263 
Productpeptidase S41A, C-terminal protease 
Protein accessionYP_379043 
Protein GI78188705 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACCCT GCCGCGTTGA AGCGTTTTGG CCACTTGCCA CCTCGCGTGC AGCTTCAGCA 
ACCACGCTTC AGCCAACTGC CCTGCACGAT GAAACGGGCA AATATATTAG CCAAACGCTG
TTGCAGTACC ATTACCGTAA ACCAGCAACA AACGATTCGC TGTCACTGCA AATTTTTAAC
CGTTACTTGG AGCAGCTTGA TGGTAGCAAA AGCTACTTTG TGGCTTCGGA GGTGGAAAGT
TTGCGCAAAG TGTATGGCAC TCGCTTTGAT GATGAATTAC TTGCAGGGAA GTCGAAAAGC
GGCTTTGGCA TGTACAACTT TTTTTTAAAG CGTGCAAAAG AGAAAATGCG CTTTATGAAA
GCAACTGCCG ATACCGCTCG CTTTAGCTTT ATGCAACCTG AGGAATTTGA GCTTGACAGA
AAGCGCACTC CATTTCTTCC CGATAGGCGC CAACTTACCG CGCTTTGGCG ACGAGAGTTA
AAATATCAGT GGCTTACCCT AAAGCATAGT GGCGAAAAAA ACAGTTCTAT TCGTGCCGAG
CTTTCTAAAA GCTATGCAAG CCGTTTAAGC TTGTTGCAAC GCCAAACGCC AAACGATGCT
TTTCAAAGTT ACATGGCAGC CGTTACCACT TCGTTTGACC CGCATACTAG CTATTTATCG
CCCGACGACT ATACCAATTT TCAAATTGAT ATGAGCCGTT CGCTTGAAGG TATTGGTGCG
AAGCTCCAAA CCGAAGGGCA ATACACGGTA GTGGGTGAAA TTATTCCGGG TGGACCTGCC
TTTAAAACAG GTTTTGTTAA AAAGGGTGAT AAAATAATTG CCGTAGGGCA GGGAAGTAGT
GCGCCTATGG TGGATGTTAC GGGCTGGCGC ATTAACGATG TGGTCAAGCA AATTCGTGGA
CCAAAAAACA GCATAGTACG TTTAAAAATA TTGCCAGCAA GTCAAGGTGG AGTAGCTTCC
ACTAAGGTGG TGCAGTTAGT TCGCGAAAAA ATTGATTTGC AAGAACAAGC TGCCCGCAAA
AGCATTATTC AGCAAAATGG ATTGAAAATT GGCGTTATCA CCATTCCCTC ATTTTATCTT
GATTTTGAAG GGCAACAAAA GCAAGCCACC AACTATGCTA GCACAAGTCG CGATGTTGCC
CGCATTGTGG AGGAACTGCA ACGTGAGGAA TTAAGCGGCA TTATTCTTGA TTTGCGCGAT
AATGGTGGAG GCTCGCTTGA AGAGGCAGTG AACGTTACGG GGCTTTTTAT TACAAGCGGT
CCTGTGGTGC AGGTGAGCAA TGCTTCAGGC GGCAAAAGCG TTGTGCGCGA TGACGACCGC
CGCATTTTTT ACAGCGGTCC ACTTGCCGTG TTGGTGAATC GTTATAGCGC TTCAGCTTCT
GAAATTGTAG CGGCGGCTAT GCAAGATTAT AAACGAGGCA TTGTTATTGG TGAACGCACC
TTTGGTAAAG GCACCGTGCA AAGCATTGTT AAGCTTACAC GTCCCTTTCA CTTTTTTGGC
AAAGCGCCAG AGTTTGGTCA GCTTAAGCTT ACCGTAGCAA AATTTTACCG CATTTCAGGC
GGTAGTACCC AGCACAAAGG TGTAGTGCCC GATATTACCA TGCCGTCACT GATTGATACC
TCAAGCGTTG GTGAGGATAC TTATAGCAGC AGTTTGCCAT GGAGCACCAT TTCACCTGCC
CTATTCCGTC CTATTGCCGA TGTTACGCCC GAGCATGTTA CCCAGTTGCG CCAAAAGCAG
CAAGTGCGTA TTGATACCTC ACGTCTGTAC AAAACCTACA TGCGTGATCT TGCAACGCTT
AACCGCATTC GCAAGAAAAA AAGCATCACC TTACAAGACT CCTCCTTTAA GTCGGATGTA
GAAACGCTCC GCCAAATTGA AAAAAATTGG GGTGAAAGTA ATGAGCTGGA TTCAACGCAC
ACGAAAAGTG GTGGTAAAGC TTTAGAGCGC GATGTGTTGT TGCAACAATC CTCAGCGGTT
ATGGCGGATT TTGTGGAACT TAAAACTACC GAACGCCAAA CGGTTATTCG TGCGGTGCCC
GCGTTGAATT AA
 
Protein sequence
MPPCRVEAFW PLATSRAASA TTLQPTALHD ETGKYISQTL LQYHYRKPAT NDSLSLQIFN 
RYLEQLDGSK SYFVASEVES LRKVYGTRFD DELLAGKSKS GFGMYNFFLK RAKEKMRFMK
ATADTARFSF MQPEEFELDR KRTPFLPDRR QLTALWRREL KYQWLTLKHS GEKNSSIRAE
LSKSYASRLS LLQRQTPNDA FQSYMAAVTT SFDPHTSYLS PDDYTNFQID MSRSLEGIGA
KLQTEGQYTV VGEIIPGGPA FKTGFVKKGD KIIAVGQGSS APMVDVTGWR INDVVKQIRG
PKNSIVRLKI LPASQGGVAS TKVVQLVREK IDLQEQAARK SIIQQNGLKI GVITIPSFYL
DFEGQQKQAT NYASTSRDVA RIVEELQREE LSGIILDLRD NGGGSLEEAV NVTGLFITSG
PVVQVSNASG GKSVVRDDDR RIFYSGPLAV LVNRYSASAS EIVAAAMQDY KRGIVIGERT
FGKGTVQSIV KLTRPFHFFG KAPEFGQLKL TVAKFYRISG GSTQHKGVVP DITMPSLIDT
SSVGEDTYSS SLPWSTISPA LFRPIADVTP EHVTQLRQKQ QVRIDTSRLY KTYMRDLATL
NRIRKKKSIT LQDSSFKSDV ETLRQIEKNW GESNELDSTH TKSGGKALER DVLLQQSSAV
MADFVELKTT ERQTVIRAVP ALN