Gene Cag_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0423 
Symbol 
ID3747689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp497006 
End bp498565 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content47% 
IMG OID637772953 
Productexopolyphosphatase, putative 
Protein accessionYP_378739 
Protein GI78188401 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAG AAGCATTGCG CGTAGCGGCT ATTGATCTTG GTACCAACTC TTTCCACATG 
ATTATTGTGG AGGGAAGTCG CGATAAAGGT ATTGTTGAAA TTGACCGCGT GAAGGATATG
ATTGGCATTG GGCATGGCAG TATTGCCACT AAAATGCTGA CGGAGGAGGC AATGCAAGCG
GGCATAGCAA CGCTAAAGAA GTTTTTAGTG TTAGCTGCCC AGCATGGCGT GCAATTTGAG
CATATTCTTG CTTTTGCTAC CAGCGCCATT CGCGAAGCAA AAAATCGTCT TGATTTTATT
AATCGTGTTC GGGCTGAAAC GGGCTTAAAA GTCAAGATTA TTTCGGGTAA AGAGGAGGCG
GAGTTTATTT ACTACGGCGT GCGCAATGCG GTAAGTGTTG GCAAAACGGC GGATTTGATT
TTTGATATTG GTGGCGGTTC GGTGGAGTTT GTGTTAGTGA ATCACAAAGG GGTGCAACTG
CTTGAAAGCC GTAAAATTGG TGTGGCGCGT ATGCACGAGC GCTTTGTTTC AAGCGATCCC
ATAGCAGCAA ACGATGTTAA AATGCTTGAA CAATTTTTTG CGGCTGAAAT GGTTTCGGCG
GTGGATAAAG CTACGACAAT GAAAGTGCGT CGTGCGGTTG CGTCATCAGG CACAGCCGAG
ACCATTGCCC GCATGATTCA CGCCATGCAA GGGCGCGATA GCGATGGCGC GTTAAACAAT
AGTTGCTTTA CGCGCAGTGA GTTTCAGCAA CTCTACCACA CCGTGTTGCT CATGAATTCA
GCAGAGCGCA AAAAAATGAG CGGCTTGGAT GAAAAGCGGG TTGATTTAAT TGTGCCAGGG
CTCATTTTAG TGGATATGAT TTTTAAGCTC TTTCGGCTTG AAGAAATTGT TATTGCCGAT
TCGGCTTTGC GTGAAGGCAT GGTGCTGCAC TACTTGCAGC AGCAAGGTTC GGTGCTTAAA
AAACGAGGTC ATCAAGAATC GCTTGATATT CGGCGCGAAA GCGTTAATGA GCTGGGATTC
CGTTGCCATT GGGATCGTGG GCATTCGGAA TACATTGCTC GCCTCTGCCT TCAGCTTTTC
GATAAACTTG CTCCCCTCCA TCAGCTTGAA GAAAATTATC GTGAATTGCT GGAATATTCT
GCTCTTCTGC ACAACATTGG AGCCTTTATT TCAATCTCCT CCCATCATAA GCATAGCCAA
TACATTGTCA TGAATGGCGA GTTGCGCGGC TTTTCTCCCT CCGAAATTGC CATTCTTGGG
CATGTAGTGC GCTACCATCG TAAGTCGCCC CCTTCCGAAA AACATACGCC CTATAATGCC
TTAAAGCTGC CGCACAAACG GGCGGTTGAT GTGCTTTCGG GCATTTTGCG CATTGCCAAC
GGCTTGGAAC GTGGACATCG CCAAAACGTG CAAAATGTTG ATGTGCAAGT AAAAGGCAAA
AGCATTACCA TGGCGCTAAC CTGCTGCTTT GAACCCGATA TTGAAATATG GGCAGCCGAT
CAACTCAAGG CGTGGCTTGA AACGGTGCTA CAAAAAACCA TCCATTTTCA ACGCGCGTAA
 
Protein sequence
MNKEALRVAA IDLGTNSFHM IIVEGSRDKG IVEIDRVKDM IGIGHGSIAT KMLTEEAMQA 
GIATLKKFLV LAAQHGVQFE HILAFATSAI REAKNRLDFI NRVRAETGLK VKIISGKEEA
EFIYYGVRNA VSVGKTADLI FDIGGGSVEF VLVNHKGVQL LESRKIGVAR MHERFVSSDP
IAANDVKMLE QFFAAEMVSA VDKATTMKVR RAVASSGTAE TIARMIHAMQ GRDSDGALNN
SCFTRSEFQQ LYHTVLLMNS AERKKMSGLD EKRVDLIVPG LILVDMIFKL FRLEEIVIAD
SALREGMVLH YLQQQGSVLK KRGHQESLDI RRESVNELGF RCHWDRGHSE YIARLCLQLF
DKLAPLHQLE ENYRELLEYS ALLHNIGAFI SISSHHKHSQ YIVMNGELRG FSPSEIAILG
HVVRYHRKSP PSEKHTPYNA LKLPHKRAVD VLSGILRIAN GLERGHRQNV QNVDVQVKGK
SITMALTCCF EPDIEIWAAD QLKAWLETVL QKTIHFQRA