Gene Cag_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0903 
Symbol 
ID3748094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1237181 
End bp1238410 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content51% 
IMG OID637773435 
Productpeptidase M20D, amidohydrolase 
Protein accessionYP_379211 
Protein GI78188873 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAG AAGAGAGCCA TCACCCCATA GCCGAAGCCA TTCAGCACAA AGCTGCCGAA 
TTATTTCCTG AAGTTGTAGC CCTACGCCGC GACATTCATG CCCATCCCGA ACTCTCGCTG
CAAGAGCACC GCACCACAGC GCTTATTACC AGCTACCTTA TGCAGCTTGG CATTACGCCC
GAAAAACCCC TGCTCGACAC GGGCGTTATT GCACTTATTC GAGGTACGTC GCCCCACCAC
CACGGCAAAG TGATAGCATT GCGTGCCGAT ATTGATGCGC TTCCTCTCCA AGAAGAAAAC
TCGACGGACT ATTGCTCAAT TGAAGCAGGC AAAATGCACG CTTGCGGGCA CGACATGCAC
ACCGCCATGC TTCTTGGCGC TGCAAAAATT CTTTCGGGCA TGAAAGAGCA ACTTGCTGGC
GATGTTCTCT TAATTTTTCA ACCATCCGAA GAAAAAGCAC CTGGTGGTGC TCGTCCACTG
CTTGATGCAG GACTTTTTGC CACCTATAAG CCCATTCTCA TTTTGGGACA ACACTGCTTT
CCCACCATAG AGTGCGGCAG CGTAGCATTT TGCCGAGGTG CTTTTATGGC GGCAGCCGAT
GAACTCTATA TTACGGTTAA CGGCAAAGGT GGGCACGCCT CAGCCCCGCA CAAAGCCGCC
GATCCCGTGT TAGCCGCCGC TCACATGGTA ACCGCCGTGC AACAGCTTGT AAGCCGTGTA
GTGCCACCCC ACGAAGCCGC CGTTGTTACC ATTTCAGCCA TTAATGGCGG TCATGCAACC
AACGTAATTC CACGCCAAGT AACCATGATG GGCACTATGC GTAGCATGAA CGAAGAGGTA
CGCGCTATTT TGCAAGAACG GTTACAGCAA GCCATTACCC ACACTGCACA AGCCTTTGGT
GTAGAAGCTG AGCTTACTAT TGTAAAAGGC TACCCCGTGC TTTACAACAA CCAAACCATT
ACCGACCAAG CCTCCTGCAT TTGCGCCGAA TATCTCGGTC ATCATCAAGT GCAGCATTGC
CAACCCTTGA TGACCGCCGA AGACTTTGCA TATTATTTGC AAGAGTGCCC CGGCACATTT
TGGCAAATTG GCACAGGTGT GCGCGAAGGC GAAACCGCAA ATACCCTCCA CTCCCCCACC
TTTAACCCCA ACGAAGAGGC TCTTCAAGTT GGTACAGGGT TGCTTGCATA CAACGCTTAT
CGTTTTCTTG CATCACTACA TGGGGAGTAA
 
Protein sequence
MKQEESHHPI AEAIQHKAAE LFPEVVALRR DIHAHPELSL QEHRTTALIT SYLMQLGITP 
EKPLLDTGVI ALIRGTSPHH HGKVIALRAD IDALPLQEEN STDYCSIEAG KMHACGHDMH
TAMLLGAAKI LSGMKEQLAG DVLLIFQPSE EKAPGGARPL LDAGLFATYK PILILGQHCF
PTIECGSVAF CRGAFMAAAD ELYITVNGKG GHASAPHKAA DPVLAAAHMV TAVQQLVSRV
VPPHEAAVVT ISAINGGHAT NVIPRQVTMM GTMRSMNEEV RAILQERLQQ AITHTAQAFG
VEAELTIVKG YPVLYNNQTI TDQASCICAE YLGHHQVQHC QPLMTAEDFA YYLQECPGTF
WQIGTGVREG ETANTLHSPT FNPNEEALQV GTGLLAYNAY RFLASLHGE