Gene Cagg_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3471 
Symbol 
ID7269696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4229468 
End bp4231570 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content54% 
IMG OID643568279 
ProductPolyphosphate kinase 
Protein accessionYP_002464747 
Protein GI219850314 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0855] Polyphosphate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.448785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTAC CTGAATCACC AATACTCGAT ACGCCACCGC TCGCCGAGTC GCGTTACTTT 
AATCGCGAAT TGAGCCTGAT CGAGTTTAAT CGTCGCGTAC TCGAAGAGGC GATGGATCCG
CGAAACCCGT TGCTCGAACG GGTGAAGTTT CTTGCGATCT TTGCCTCTAA TCTCGACGAA
TTTTTCATGA TCCGGGTGAG CGGTATCAAA CAACAAATCC GAGCCGGTGT GCAAAAACGT
TCACCTGACG GTCAGACACC GAGTGAACAG TTAAGTGCTA TCCGCCGTGC CCTGCTTCCA
TTGCTCGATC AAGAGCGCGA TTTGCTCCTC AACGAGCTAC TCCCTGCTCT TCGCGAGCAG
GGGATTAGTA TTCTCAATAC AGTCGCCCTT AACGCAGCAC AACGAGCATG GGTCGCCGAT
TATTTTCGTC GTCAGATCTT TCCGGTACTT ACACCACTTG CGTTCGATTC GAGCCGACCG
TTTCCCTTCA TCTCAAACCT GAGTCTCAAC CTCGCCGTAG TCATTCGCGA TCAGGCGAAG
GGGGAACTTT TTGCCCGCAT TAAAGTTCCC GAAGTCTTGC CGCGCCTGAT TCCACTCCCG
CAAGAACTTT GCCCGCCGGT CGGTGAATTG CCGCCGAGCC GTTGCCACTG CTTCGTCTGG
ATCGAACAGG TGATCGCCGA TCATCTCGAA CAGCTCTTCC CCGGCATGAA TGTGGTTGAG
GTTTACCCCT TCCGTGTAAC ACGCAATGCC GATGTGGAGA TCGAAGAGGA TGAAGCCGAC
GATTTGCTGG CTACCATCGA ACAAGGATTG CGTCAACGCC GGTTTGGCGA GGTGGTGCGC
CTCGCTATCG ATAGCGGCAT GCCTGAACGG ATCTGCCAAC TGTTAGCCGC CAATCTGAAA
GTTAGCTCCG ATGACATCTA TACCGTGCGT GGCCCGCTCG GCCTCAGCGA TCTGATGCAA
CTAACCAATC TTGATCGGCC TGATCTGAAA GACCCGCCCT ACGTGCCGCG TGTACCGGCT
ATCCTTAAGA ATAGTGGGAC GATCTTCGAG GCGATTAAGA AAAATGATAT CTTATTACAC
CACCCATACC ACTCGTTTAG CCCGGTGATC GATTTTATTC AAGCCGCCGC CGAAGATCCG
AATGTGTTGG CAATTAAGCA GACGCTCTAC CGTGTCGGGC GTAACTCGCC AATCGTACAG
GCATTGATGC ATGCCCGCGA GCACGGCAAA CAGGTGACGG TGGTCGTCGA GTTAAAAGCT
CGGTTCGATG AAGAAAACAA TATTACGTGG GCCCGCGCAA TGGAACGAGC CGGTGTCCAC
GTGGTGTATG GCCTGGTCGG TCTGAAGGTG CATGCCAAAC TCGCACTGGT CGTTCGGCAA
GAGAGTGACG GTATTCAGTG TTATGTCCAC CTTGGTACCG GTAACTACAA CGCGGTGACT
GCACGTGTCT ATACCGATCT CGGATTACTG ACATGCCGAC CCGAGATTGC TGCCGATGTG
GTCGATCTGT TCAATTATCT CACCGCGTAC AGCCGGCAAA AAGAGTATCG CAGCTTACTG
GTTGCACCGG TTAATCTGCG TCACCGCATG ATCGAGTTGA TCGAGGAAGA GATCGCACTT
CACCGCTTGT ACGGTAATGG GCGACTCATC TTCAAAATGA ACGCATTGGT CGACCGCAAG
ATGATTGATG CGCTCTACGC TGCCTCACAA GCCGGGGTAC AGATCGATCT GATCGTGCGA
GGGATGTGTT CATTGCGCCC ACAAGTTCCC GGCCTCTCCG ACAATATTCG GGTGCGCTCG
ATTGTCGGTC GCTATCTCGA ACATAGCCGG ATCTACTACT TCTCCCACGG TGGTAAGCCT
AAAGTGTATA TCGGGAGCGC CGATATGATG GAACGAAACC TCGACCGGCG CGTTGAAGAA
CTCTTCCCCC TCTCCGACCC AATCGCAATC CAATACGTCA CCGAGCGGCT ACTTCCTACC
TATCTGGCCG ATAACTTACG CGCCCGTGAA CTCCAACCTG ATGGTCGTTA TGTGCGCGTC
CATCCCGACG GTCACGAGGT TATCGACAGC CAAGACCCTG CCCGCATTAT TCCCGGCTGT
TAA
 
Protein sequence
MTVPESPILD TPPLAESRYF NRELSLIEFN RRVLEEAMDP RNPLLERVKF LAIFASNLDE 
FFMIRVSGIK QQIRAGVQKR SPDGQTPSEQ LSAIRRALLP LLDQERDLLL NELLPALREQ
GISILNTVAL NAAQRAWVAD YFRRQIFPVL TPLAFDSSRP FPFISNLSLN LAVVIRDQAK
GELFARIKVP EVLPRLIPLP QELCPPVGEL PPSRCHCFVW IEQVIADHLE QLFPGMNVVE
VYPFRVTRNA DVEIEEDEAD DLLATIEQGL RQRRFGEVVR LAIDSGMPER ICQLLAANLK
VSSDDIYTVR GPLGLSDLMQ LTNLDRPDLK DPPYVPRVPA ILKNSGTIFE AIKKNDILLH
HPYHSFSPVI DFIQAAAEDP NVLAIKQTLY RVGRNSPIVQ ALMHAREHGK QVTVVVELKA
RFDEENNITW ARAMERAGVH VVYGLVGLKV HAKLALVVRQ ESDGIQCYVH LGTGNYNAVT
ARVYTDLGLL TCRPEIAADV VDLFNYLTAY SRQKEYRSLL VAPVNLRHRM IELIEEEIAL
HRLYGNGRLI FKMNALVDRK MIDALYAASQ AGVQIDLIVR GMCSLRPQVP GLSDNIRVRS
IVGRYLEHSR IYYFSHGGKP KVYIGSADMM ERNLDRRVEE LFPLSDPIAI QYVTERLLPT
YLADNLRARE LQPDGRYVRV HPDGHEVIDS QDPARIIPGC