Gene Cag_0824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0824 
Symbol 
ID3746823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1150406 
End bp1151470 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content46% 
IMG OID637773354 
ProductDHH family protein 
Protein accessionYP_379133 
Protein GI78188795 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.534288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATTC CCTCCTACGG TCGCACCCTT CACGCTGAAG AGTGGCAACC GCTCCTTGAG 
CCGCTGCTTG CAGCTCAACA CCTTGTTTTA ACAACGCACG AAAATTCTGA TGGCGATGGC
TTAGGGTGCG AAGTTGCCCT TGCTCTTGCT CTTACGGCTC TTGGCAAAGA GGTTTCCATT
GTGAACCCAA CGGAAGTACC GCCCAACTAC CAATTTTTGA GGCAACTCTA CCCAATAGTT
CAATTTAATC CCAAAAGTGA AGAGGCAATT CAAGAGCTTT CGCTGTGCGA TGCCGTGGTG
CTGCTTGATG CCAATTTAAG CGACCGCATG GGAACCTTGT GGCCTCACGT TCGTTTTGCA
CGCGAGCTTG GTAGTTTAAA GCTTCTCTGC GTTGATCACC ATCTTGAACC AAATGATTTT
ACCGATGTTA TGATTTCGGA GTCGTATGCC TCCTCCACTG GCGAGTTAGT ATATGGCTTA
ATTCTTGCTA TGGAACAAAG TGTTGGGCGT GCGCTCTTTA CACCCAATAT TGCTCAAGCG
CTCTATGTGG CGGTAATGAC GGATACGGGT TCATTCCGAT TTTCAAAAAC AACTCCATAC
GTTTATCAAT TAGCGGGCGA TTTAGTGGCG CGTGGGGCTA ATCCCGAAAA AGCATACGAT
TTAATTTTTA ATTCGCTAAC GCCTCAAGCG CTCAAATTAC TTGGCTTGTC GTTAAGCGCT
ATTTCTCTTG TTGAGGGGGG AAAACTTTCG TGGCTGCTTA TTTCACAAGA GATGTTAAAA
GCAACGGAAA GTAAGTTGTT TGATACTGAT ATTATTGTCC GTTATCTTTT AAGTGTGCCC
TCAGTTGCCA TAGCGGTACT TTTAGTTGAA ATGCAAGATG GACGTACCAA AGCAAGTTTT
CGCTCGCGTG GCAAGTTGCC CGTTAATAAA CTTGCTAAAG AATTTGGCGG CGGTGGGCAT
ATGAATGCGG CTGGTGCGCT TTTTCCCTAT ACGCCCGAAA AGGTACAACA AGTGCTTCCG
CAAGCTGTGC GTCGCTTTAT AAAAGAGCAT GAAGCGCTGC TGTAA
 
Protein sequence
MIIPSYGRTL HAEEWQPLLE PLLAAQHLVL TTHENSDGDG LGCEVALALA LTALGKEVSI 
VNPTEVPPNY QFLRQLYPIV QFNPKSEEAI QELSLCDAVV LLDANLSDRM GTLWPHVRFA
RELGSLKLLC VDHHLEPNDF TDVMISESYA SSTGELVYGL ILAMEQSVGR ALFTPNIAQA
LYVAVMTDTG SFRFSKTTPY VYQLAGDLVA RGANPEKAYD LIFNSLTPQA LKLLGLSLSA
ISLVEGGKLS WLLISQEMLK ATESKLFDTD IIVRYLLSVP SVAIAVLLVE MQDGRTKASF
RSRGKLPVNK LAKEFGGGGH MNAAGALFPY TPEKVQQVLP QAVRRFIKEH EALL