Gene Cyan8802_3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3993 
Symbol 
ID8393343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4108392 
End bp4110677 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content44% 
IMG OID644981917 
ProductPEP-utilising protein mobile region 
Protein accessionYP_003139631 
Protein GI257061743 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.608484 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGATAA CTCTGTATTG GCTTTGTGAT ATTGAAACCA GCGATCGCCT GTGGGTAGGC 
GAGACCGCTT GGATGCTTAG CCGACTGCAA CGGGAGGGCT ATCCGATTGA TGGAGGATTG
GTGGTGAGTG GCCAAGTTTG GCGAGAATTT CTCAAGAGGT TTGACAATTC AACCTCATTA
CTAGCAGATT TTCCCGCATC TTCCCTCCAT TTAGATATCG ATAATCCTCG TTCCTTACAA
TTAGTCGCTC AACAAAGTCG TCAAGCTATC GGACAAATTC CCTTTTCTCA GGAATGGTTA
TCTGAGTTAA AAAAAGCGAT TGAAGGATTA GATACACCGT GTTTGAGAGT CCAAAGTTCC
CTGGCTACAC CATTGACCTT TAAAAAACCG TTTTCTAGCT TAATTGAGCC ACAAATCTGT
GATAATTCTC CAGAAAGCTT AGAATTAGCC ATTAAACAAA TCTGGGGGCA ATTATTTAGT
GCTAAAAGCC TATTTTATTG GCAGCGTCTG GGATTAGGCA TTGAGCAATT AAACCTAGCA
GTCTTAATCC AACCCTTAAC GAATGCGATC GCCTCAGGAA CCGCAACCCT CACCCCCAAT
TCCTTTGAAA TTGAAGCCAC CTATGGGTTA GGAGAGAGTT TGCGATGGGG GGAAGTCTTA
CCCGATCGCT TTACCCTAGA TCCCCGAACA GGAAGCCTTA TTCACCAAGA ACTAGGGCAT
AAAATTCGGG CTTATCGACT TAAGGATAAC ATTAAAGAAA ATGCGCTAGA AGCTTATGTT
CTCAGTCCCG GGGAACAAGA GCAATATTGC CTCAATCAAT CAACTTTATC TGCACTATTC
CCTGTACTCA AGCGTCTTAG TCAAGATAAC CTTTCTGGGA TTTCTCTAGA ATGGATCTTG
AGAAAATCCC ATAACTCTCA ACTACCCCCT CTAGCGATCG CCCAGGTAAT ACCCGCTATG
TCCGTTCCCC TAACTCCTCC CTCGGCTTCC CCTGAACCCA AGACAAGCAG GAATCATCAG
CCTCTACTCA AGGGAATTGC CGCCGCTCCT GGTCGCATCC ATGCTTTAAC TCAGGTGATC
GAAGAAGGGA TGCTTCCCGA TCCTTCCTTA GCTAAACAAA ACATCATCGT TACCAGAGAA
ATCAACCCGT CTCAACTATC CTGGCTCAAA AATGCGGCGG GGGTGATCAC AGAACTCGGC
GGCATGACCA GTCATGCCGC CATTTTAGCG CGAGAATTAG AAATTCCTGC GGTGGTAGGA
GTCTCAGGAG CAACGAATTT ACTCAGAACC GGAGACTCCA TTATTATTGA CGGTCAAAAA
GGAGAAATTT ATCGCTATCT TGGGCAAGAA GAGGCAGTCT ATCCCTATCA ATCTGTCGCT
CCTAAGCCGA TAGTGACCAG TGGATTTCAT CCCATTGCTA CAAAATTGAT GGTTAATTTG
AGTCAACCGA GTTCTCTGAT CAAGGCGATG GATTTACCCG TTGATGGGGT TGGATTATTG
CGATCGGAGT TAATGCTGTT GGATCTGTCT TCTCCTAATT CCGTGGAACA ATGGTTAGAG
CAAACACCAC CATTTGAGAT TATAGAACAG TTAAGACAAT CTATTCAAGA ATTTACGGCT
CGTTTTGCCC CTCGTCCTGT ATTTTACCGT TCTTTTGATG GACAATCTTG CCAAGGAGTT
AACCCACCTT CGTCAAAAGA CTGTCGGGGA ACGGCGGGTT ATCAAATCGA TCCAACCTTG
TTTGATTGGG AATTACAAGC TTTAGTTCAA GTTCAGCAGC AGGGATACCA TAATCTCCAT
TTAATTTTAC CGTTTGTGCG TAGTGTCGGG GAATTTCGCT TTTGTCGTCG TCGGGTAGAA
CAGGCCGGGT TACTCGAATC TGATGGGTTT CAATTATGGA TTATGGCTGA AGTTCCTTCG
GTGATTTTTG AGCTTAGGGA TTATGTGAAA GCAGGGGTTC AAGGGGTGGC AATTGGAACT
AATGATCTGG TTCCTTTTTT GTTAGGAATT AACCGAGATC ATCCTCAAAT TAATGATAAT
TCTAAACCCT GTCCAACGGC GTTATCAAAT GCTTTAAAAC AGTTAATTGA ACAATGCCGT
GAGTTAGGGA TTCCCTGTTG TTTGTGTGGA CAAGTGGCGA TACAATATCC CTATCTTATT
GATCAGTTGA TTGACTGGGG AATTACGTCT ATTTCTGTAG AACCAGAAGC CATAGAACGA
ACCTATCAAG CGATCGCTAG GGCAGAACAA CGGTTATTAT TAGAAATAAA ACGTCTTAAA
GGATAA
 
Protein sequence
MVITLYWLCD IETSDRLWVG ETAWMLSRLQ REGYPIDGGL VVSGQVWREF LKRFDNSTSL 
LADFPASSLH LDIDNPRSLQ LVAQQSRQAI GQIPFSQEWL SELKKAIEGL DTPCLRVQSS
LATPLTFKKP FSSLIEPQIC DNSPESLELA IKQIWGQLFS AKSLFYWQRL GLGIEQLNLA
VLIQPLTNAI ASGTATLTPN SFEIEATYGL GESLRWGEVL PDRFTLDPRT GSLIHQELGH
KIRAYRLKDN IKENALEAYV LSPGEQEQYC LNQSTLSALF PVLKRLSQDN LSGISLEWIL
RKSHNSQLPP LAIAQVIPAM SVPLTPPSAS PEPKTSRNHQ PLLKGIAAAP GRIHALTQVI
EEGMLPDPSL AKQNIIVTRE INPSQLSWLK NAAGVITELG GMTSHAAILA RELEIPAVVG
VSGATNLLRT GDSIIIDGQK GEIYRYLGQE EAVYPYQSVA PKPIVTSGFH PIATKLMVNL
SQPSSLIKAM DLPVDGVGLL RSELMLLDLS SPNSVEQWLE QTPPFEIIEQ LRQSIQEFTA
RFAPRPVFYR SFDGQSCQGV NPPSSKDCRG TAGYQIDPTL FDWELQALVQ VQQQGYHNLH
LILPFVRSVG EFRFCRRRVE QAGLLESDGF QLWIMAEVPS VIFELRDYVK AGVQGVAIGT
NDLVPFLLGI NRDHPQINDN SKPCPTALSN ALKQLIEQCR ELGIPCCLCG QVAIQYPYLI
DQLIDWGITS ISVEPEAIER TYQAIARAEQ RLLLEIKRLK G