Gene Cag_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1768 
Symbol 
ID3746628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2285395 
End bp2286873 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content50% 
IMG OID637774305 
ProductTPR repeat-containing protein 
Protein accessionYP_380062 
Protein GI78189724 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGATGT TGGCGGGATG CTCTTCCAGC TCGTCAACAG TTTCCACCCA AAAAATCCAA 
GCACCTCTTC CCAAACCACT GCCCGAAACC GTTGCGTATG AGCTGGCTAC GGCATCGCTT
TTAATGGCGC AAGGTGAGTA TCAGCAAGCG CTTGAGCGAT ATCGAGCATT GCTTACCACA
GAGTCCAACA ATGCAGCCCT GCACCACGCC TTAGCAAAAG CCTACACCGC AAATGGAGAG
TTTGTGGCAG CACGCCAACA TAGCCAACAA AGCGTTACGT TAGAAGGCAC CAATGTGTGG
TATTTGCGAT TGCTTATTGC ACTAACGCAC AATGAAAGCG ATTATGCGCA AGCGGTTGCA
TTAAGCAAAA AGTTGGTGAC TTTGGAACCC GATAACCGCG AAGCGCTTAC CATGTTAGCC
TATGAGCACT TAGCGGCACG TCAGCCCAAC GAAGCGCTGG AGGTATTTCA ACGCTTATTG
CAGCTTGATC CCGCAAATGC TGAAGTATTG CTGAGTAGCG CCGAAGTAGC GCTTGAACTT
GGTCGCCGTA GCGATGCCCT CCGCTTCTTT AATCAACTCC TTCACTATGG TATTGAAAGT
GATTCCATCC ACTTTTTTAT AGGCGATTTA CAACAGCAGC AAGGGTTACA CGAAGCCGCA
CTTGCAAGCT ACCGCAACGC CCTCAAGCTC AATCCGCACC TTTTGCCCGC ATGGTATCGC
CGCCTTGAAC TTGTAGCACT TTCTCCCAAC CTTTCCCAAT CCTCAAAACC AACACTTTTT
GCCGAAGAGC TTCAGCATTT CTATAAGCAA AGCGGCACAA CATTGGAGCA ACAATTGGGG
CTTCTCCAGC TCTTTACAAA TCGAGCAACT CGCAACCCAG CCTTCATAAG CGCAACCCAA
AGCATGATAA AAGCGCTACA ACAGCGCTAT TCATCTCACT CGCTTGTACG TTTTACCGTG
CAAATTGCGC AAGGGCGATT GTTTGTGGCG CAAGGCCAGC ACGCCCAAGC CATTACACTG
CTACGCCAAG CTCTCCGCTC ACCCCATGCT ACACGCCAAC CTAATGTAGC GCTTGATGCC
GAGAGTACCC TTGCCCTTGC TTACGAGCGT TCTGGTAAAG TGACGGAGAG CATTCGTCTC
TACGAAAAGA TGTTACGCCG CACGCCCAAC AACGCCCTGC TTGCCAACAA TCTTGCCTAC
TTGCTTGCCA CACAACATCG AGAGTTGCCA CGCGCTCTTG AGCTTGCCAA AAAAGCTGTT
GCGGCGGAAC CAAATAATCC CATTTATCTT GATACGCTTG GTTGGGTACA TTTTGCCATG
CAGCAATACG AACCTGCCCG TGAGCTACTT GAAAAAGCGC TGCAAGGTGA GCCGAATGAG
CCAGAAGTGA TTGAGCACCT TATTGCGGTA TATGAAAAGC TTGGGAACCA AAGCAAAGTG
CAGGAGTTGC AGGAGCGGTT GCGGAGGGTT TGTTTATAA
 
Protein sequence
MLMLAGCSSS SSTVSTQKIQ APLPKPLPET VAYELATASL LMAQGEYQQA LERYRALLTT 
ESNNAALHHA LAKAYTANGE FVAARQHSQQ SVTLEGTNVW YLRLLIALTH NESDYAQAVA
LSKKLVTLEP DNREALTMLA YEHLAARQPN EALEVFQRLL QLDPANAEVL LSSAEVALEL
GRRSDALRFF NQLLHYGIES DSIHFFIGDL QQQQGLHEAA LASYRNALKL NPHLLPAWYR
RLELVALSPN LSQSSKPTLF AEELQHFYKQ SGTTLEQQLG LLQLFTNRAT RNPAFISATQ
SMIKALQQRY SSHSLVRFTV QIAQGRLFVA QGQHAQAITL LRQALRSPHA TRQPNVALDA
ESTLALAYER SGKVTESIRL YEKMLRRTPN NALLANNLAY LLATQHRELP RALELAKKAV
AAEPNNPIYL DTLGWVHFAM QQYEPARELL EKALQGEPNE PEVIEHLIAV YEKLGNQSKV
QELQERLRRV CL