Gene Cag_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0494 
Symbol 
ID3746363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp577652 
End bp579172 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content48% 
IMG OID637773028 
Productpeptidase S1C, Do 
Protein accessionYP_378810 
Protein GI78188472 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA GTGAGAAAAT ATCTTCACGC ATAAAAAAAG TGCTGTTGGT GTTAAGCGGC 
GTTGCGGTTG GTGCGCTTGT TTTTTCCAAC ATGGAGTACT CAGTTTCTTT TAACGGTACA
ACCTTTTCTA ACACTCCCTC TTTTGCCACA GCAACCAGCA ATATTGCTGA TGCTCCCATT
AGTTCACTAC GGAACTTTAA TGAGGCGTTT GTGCAAATTG CCGAATCGGC AACGCCTTCG
GTAGTAACTA TTTTTACCGA GAAAACGGTC AATCAGCGGG TTGTTTCGCC CTTTAACTTT
TTTGGAAGCC CTTTTGATGA CTTTTTTGGT CGTCCTGATG GGAATAGTGC CGAGCGTAAG
AATGTGCGGC GTGGCATTGG TTCAGGCGTT ATTGTAACGG CTGACGGCTA CATTCTTACC
AACAACCATG TGATTGATGG TGCCGATGTG GTTTATGTGC GCACGGCTGA TAAGCGCCGC
CTTGATGCTA AGGTGATTGG TACTGATCCC AAAACCGATA TTGCCGTTAT TAAGGTAAAT
CAGCAAGGGT TAAAGCCTAT TGTAATTGGC GATAGCGATA AGTTGCGAGT AGGGGAGTGG
GTAATTGCTA TTGGCAGTCC ACTTGGCGAA AATCTTGCAC GCACCGTAAC GCAAGGTATT
GTAAGCGCGA AAGGGCGTGC CAATGTAGGG TTAGCCGATT ATGAAGATTT TATTCAAACC
GATGCCGCCA TTAATCCGGG CAATTCAGGT GGTGCGCTGG TTAATATCAA TGGGGAATTA
GTTGGCATTA ACACGGCAAT TGCCAGCCGC ACGGGTGGCT TTGAGGGGAT TGGTTTTGCG
GTGCCATCCA ACATGGCAAA AAGCGTTTTA ACGGCGCTTA TTACCACAGG AAAAGTAACG
CGCTCCTACC TTGGCGTAAG CATTCAAGAT ATTGATGATA ACATTGCAAA AGCAATGAAT
GTAAAGGCGG GCGAAGGTGC TTTAGTGGGC ACGGTTATGG AGAATAGCCC TGCCGCACGA
GCTGGTATGC AAACAGGTGA TGTTATTTTG GAATTTAATG GCGCAAAAGT AACCAGCAGC
GCCGCCTTGC GTAATGCCAT TGCTACGCAA ACGCCCGGCA GCATGGTCTA TATTAGAGTG
TTACGCGATG GAGCGCTGAA GTCGTTTGCG GCACGCCTTG AAGAGCAAAC CCCAAAAACC
GCAAGTAGCA CAACTCCCGC TAAAAAAGCC GACATTAATA GTGCGCTTGG CTTTCGTGCC
GAAGAGCTGA CACCCGAATT GGCGCAGCGC TTAAAGCTGA AAGGGAGCAG CGGCAAAGTG
GTGATTACCG CAATTCAGCA ACAATCAACC GCCTATCGTG CAGGCTTGCG TCCGGGCGAT
GTGATTCTTT CGGTTAACAA GCAAGCGGTA AGTTCGGTAG CAAGCTATAA CGCATTGGTT
AAAAATCTTG CAAAAGGCGA ATTGCTGTTG CTCTTGATTG AGCGCGGGGG GAATAAGAGC
TACATTGCCT TTACGCTGTA A
 
Protein sequence
MKKSEKISSR IKKVLLVLSG VAVGALVFSN MEYSVSFNGT TFSNTPSFAT ATSNIADAPI 
SSLRNFNEAF VQIAESATPS VVTIFTEKTV NQRVVSPFNF FGSPFDDFFG RPDGNSAERK
NVRRGIGSGV IVTADGYILT NNHVIDGADV VYVRTADKRR LDAKVIGTDP KTDIAVIKVN
QQGLKPIVIG DSDKLRVGEW VIAIGSPLGE NLARTVTQGI VSAKGRANVG LADYEDFIQT
DAAINPGNSG GALVNINGEL VGINTAIASR TGGFEGIGFA VPSNMAKSVL TALITTGKVT
RSYLGVSIQD IDDNIAKAMN VKAGEGALVG TVMENSPAAR AGMQTGDVIL EFNGAKVTSS
AALRNAIATQ TPGSMVYIRV LRDGALKSFA ARLEEQTPKT ASSTTPAKKA DINSALGFRA
EELTPELAQR LKLKGSSGKV VITAIQQQST AYRAGLRPGD VILSVNKQAV SSVASYNALV
KNLAKGELLL LLIERGGNKS YIAFTL