Gene Cag_0303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0303 
Symbol 
ID3748101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp336183 
End bp339134 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content46% 
IMG OID637772833 
ProductM16 family peptidase 
Protein accessionYP_378622 
Protein GI78188284 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAC CGCGTAAATT TTTTCCTGTG CTGCTTTTTG CAGCTATGGT ATTATTTTTT 
TTGCACTTAA CTGCTTGTTC ACCCACCAAA ACTCTTATGA ACTCAAATAG CGCTTACCCC
TACACTACCA TTCAAGGAGA TTCTCTCCAC ACTCGTATTT ATAAATTAAA AAATGGCTTA
ACGGTTTTTA TGAGCCCTTG CTACGATGAG CCGCGCATTT ACACCTCCAT TGCCGTGCGT
GCTGGTAGCA AAAACGATCC AGCCGAAACC ACAGGCTTGG CGCACTATCT TGAACACATG
CTCTTTAAAG GCACCGATGC CATTGGTTCG CTCGATTACC ACAAAGAGCA TCCCCAGCTT
GAAAAAATTA CTGCGCTGTA CGAAGAGTAT CGCTCTACTG CCAATCCCGA AAAGCGGGCA
GCCATTTATA AAATGATTGA TTCGCTTTCA AACGTGGCGG CAAGCTACAC CGTGCCTAAC
GAGTACGACA AGCTTCTTAG CTCGCTTGGC GCTACGGGCA CCAATGCTTA CACATGGGTT
GAGCAAACCG TCTATATTAA CGACATTCCC TCCAACAAAC TTGACCAATG GCTCACTATT
GAAGCCGAAC GCTTCCGCAA TCCCGTTATG CGCTTGTTCC ACACGGAGCT TGAAACTGTG
TATGAAGAAA AAAACATGAC GATGGATAGC GATAGCCGCA AAATTTGGGA AAACCTCTTT
GCGCAGCTCT TTCAAAAGCA TCAATACGGT ACGCAAACTA CCATTGGCAA GGCTGAGCAC
TTAAAAAATC CCTCCATTAA AAATGTAATG GAGTATTACC GCTCGCACTA TGTGCCGAAC
AACATGGCGC TGTGCATTGC GGGCGATTTT GATCCCGATG CTACCATTCG TTTAATTGAT
GAAAAATTTT CGGTGTTGGA ATCGCAACCA CTTGCACGCT TTACGGTTGA AGCTGAAGAG
GAAATTACTG CGCCACGGGT AATGCACGTA AAAGGTCCAG AGTCGGAGGA GTTGGTGATG
GGTTATCGCT TTAAAGGGGT GAATAGCAGC GATGCCGATT ATTTAACGCT GATTGATAAA
ATTTTATTTA ACCACACGGC TGGCTTAATT GATTTGAATC TTAACCAGCA GCAAAAGGTG
CTTGATGCAA GCTCTATGTT GGTGCTCATG AAGGATTATT CGGCTCACTT ACTTACGGGC
AAACCTCGCG AAGGGCAGAG CCTTGAAGAG GTGCAGCAGT TGTTGATGGA GCAAATTGAA
TTGCTGAAGC AAGGCGAATT TCCTGAATGG TTGCTTGAAG CGGCAATTAA CGACCTTTAC
ACCGAGCAGC TTAAGCAGTA CGAAACCAAC CGTGGGCGGG TGGAAGCCTA TGTAGATTCC
TTTATTTGGG GTATGGAGTG GCAAGCCTAT ATGCAGCAAA TTGAGCGTCT CCACAAAATT
ACCAAAGCCG ATATTGTTGC ATTCGCCCGC AAGCATTATA GCACCAACAA CTACGTTGCA
GTCTTTAAGG AGCATGGCAC GCCCGAAAGC GAGGCAAAAA TTCAAAAGCC GCCAATTACA
CCGCTTACGG TAAATCGCGA TACCATTTCA ACCTTTGCGC AAAATCTTCT CGAACGCCCA
TCAGCGTTAA CGCAACCACG CTTTCTTGAT TACAGCAAGG ATATCAGCTT TTACAACGTA
ACCGATGATA TTACGTTGCA TTACGTTCAC AATAATGAGA ATGACCTCTT TTCGCTCTTT
TATGTTTTTG ATATAGGAAA AAATCATAGC AAAAAAATTG ATTTAGCGCT CGATTACCTC
TCCTATCTTG GCACGTCAAA ATTATCGCCA AAAGCGTATA GCCAAGAGAT GTATAAAATT
GGTGCTTCAT TCTCAGCTTA TACGGCTGAT AACTATGTTT ATCTCAAACT GTCGGGATTG
CATAAAAATG CGGAGGCGGC AATTCGCTTG CTTGAAGAGC TGTTGATGGA TGCTCAGCCT
GACGAAGAGG CGCTTGGCAA ACTGAAAGCA GGCACATTGA AAGAGCGTGC TGACGATAAG
CTTTCTAAAA AGAAAATTTT GTTTGAAGCA ATGGCGAACT ATGGCAAATA TGGTGCCCAT
TCGCCCTTTA CCAACGTGCT GAGTAACCGC GAAGTGGAGC AAGTGCGCTC GCAAGAGCTG
CTTGATGAAC TGCGCAACTT GCTGAACTAC CGCCACCGCG TGCTTTACTA CGGACCAGAA
AGCGCCGAAA ACGTGCTGAG CGAATTGCGG AGCGTGCGCC ACTATCCCGC TACATTTATG
GCAACTCCCT CGCTTGACCT CTTTAAGCCG CTTGAGGTAA CGGAGAATTT GGTCTATGTG
GTTGATTATG ATATGACGCA AGCCGAAGTA ATGATGCTCA TGAAGGATGA AACCTACAAT
TCCGCCACGC TGCCCATTGT TACGCTTTTT AACGAATATT ATGGCGGTGG CATGTCGTCC
GTTGTCTTTC AAGAGTTGCG CGAAGCCAAA GCGCTGGCTT ACTCCGTCTT TTCAGTCTAT
CGCACGCCAA AGCAAAAAGG TGAGCACAAC TACATTATTA GCTACATTGG TACGCAAGCC
GATAAGTTGC CCGAAGCGCT TGAAGGCATT GGCGATTTAA TGAAAACGCT CCCTGAATCG
CCCCAACTTT TTGAAACAGC TCAAAAAGGA ATTGAGCAGA AAATTGCCAC CGAGCGCCTT
ATTAAAACTG AGATTCTCTT TAATTATGAG GAAGCGCTTC GCCTTGGGCA CTCGCACGAT
GTGCGTAAGG ATATTTACGA TGCCACGCAG CGCATGAGTT TGGAGGATGT GAAAGCGTTC
CACAAAAAGC ACTTCAGCAA CAAAAAACAG GTAATGCTGG TGCTTGGTAA CCGCAAAAAC
CTCGATATGG CAACCTTGCG TAAGTACGGC ACCGTGCGCG AGCTAACGCT TAAAGAGATT
TTTGGCTACT AA
 
Protein sequence
MAKPRKFFPV LLFAAMVLFF LHLTACSPTK TLMNSNSAYP YTTIQGDSLH TRIYKLKNGL 
TVFMSPCYDE PRIYTSIAVR AGSKNDPAET TGLAHYLEHM LFKGTDAIGS LDYHKEHPQL
EKITALYEEY RSTANPEKRA AIYKMIDSLS NVAASYTVPN EYDKLLSSLG ATGTNAYTWV
EQTVYINDIP SNKLDQWLTI EAERFRNPVM RLFHTELETV YEEKNMTMDS DSRKIWENLF
AQLFQKHQYG TQTTIGKAEH LKNPSIKNVM EYYRSHYVPN NMALCIAGDF DPDATIRLID
EKFSVLESQP LARFTVEAEE EITAPRVMHV KGPESEELVM GYRFKGVNSS DADYLTLIDK
ILFNHTAGLI DLNLNQQQKV LDASSMLVLM KDYSAHLLTG KPREGQSLEE VQQLLMEQIE
LLKQGEFPEW LLEAAINDLY TEQLKQYETN RGRVEAYVDS FIWGMEWQAY MQQIERLHKI
TKADIVAFAR KHYSTNNYVA VFKEHGTPES EAKIQKPPIT PLTVNRDTIS TFAQNLLERP
SALTQPRFLD YSKDISFYNV TDDITLHYVH NNENDLFSLF YVFDIGKNHS KKIDLALDYL
SYLGTSKLSP KAYSQEMYKI GASFSAYTAD NYVYLKLSGL HKNAEAAIRL LEELLMDAQP
DEEALGKLKA GTLKERADDK LSKKKILFEA MANYGKYGAH SPFTNVLSNR EVEQVRSQEL
LDELRNLLNY RHRVLYYGPE SAENVLSELR SVRHYPATFM ATPSLDLFKP LEVTENLVYV
VDYDMTQAEV MMLMKDETYN SATLPIVTLF NEYYGGGMSS VVFQELREAK ALAYSVFSVY
RTPKQKGEHN YIISYIGTQA DKLPEALEGI GDLMKTLPES PQLFETAQKG IEQKIATERL
IKTEILFNYE EALRLGHSHD VRKDIYDATQ RMSLEDVKAF HKKHFSNKKQ VMLVLGNRKN
LDMATLRKYG TVRELTLKEI FGY