Gene PCC8801_4300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4300 
Symbol 
ID7102662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4519656 
End bp4520753 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content38% 
IMG OID643477280 
Productmetallophosphoesterase 
Protein accessionYP_002374379 
Protein GI218249008 
COG category[R] General function prediction only 
COG ID[COG1409] Predicted phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATA ATCGTCGTCA ATTTATAATT TTTCTTTGTT GTCTTTTAGG GGTTGTTATG 
GCTACGGTTA GCCATCAAGT ATTTAGCCGA AATAATATTA CGTCTGAACC GCCATCTAAC
GTCGTTGAAA ATCCAATAGA AACTCCTATT AATGAACCCG TAGCGGCTGC CCCTTCAGGA
TTATTTGCCC CGGTTAAAGG GGATGTTAGA ATTGTTGTGA TTAGTGATTT GAATAGTCAG
TATGGTTCAA CCAGTTATGA ACCGGAAGTT AAAGAAGCGA TCGCCCTAAC TCCCCAATGG
AAACCAGACT TAGTATTATG TGGGGGAGAT ATGATTGCCG GACAAAAAAG ATCCCTAACT
CAACAACAAA TTCAAGCCAT GTGGTCGGCG TTTGATGCTA ACATTAGTAA GCCCTTACGT
CAAGCGAAGA TTCCCTTCGG GTTTACCATT GGTAATCATG ATGGATCAGG GGCAATCAGT
CAAGGAAAAT TAATTTTTAA ATCAGAAAGA GACTTAGCTT CAACGTATTG GAATCAACCC
CAAAATAATC CAGGGTTAAA CTTTGTTGAT CGGGGAAATT TTCCGTTTTA TTATAGTTTT
ATACAAAAAG ATATTTACTA TTTAGTGTGG GATGCCTCTA CTCATATTAT TTCATCTGAA
CAATTAGCTT GGGTAGAAAA AAATTTAGCC AGTCCTGTTG CTCAAAATGC CAAATTACGC
CTAGTGATTG GACATCTTCC CCTCTATCCA GTTGCGGTAG GACGTAATGA CGGAGGGAAC
TTTTTAAGTA ATGCTGAAAA ACTACAAGCC TTATTAGAAC GCTATCAAGT TCATACCTAT
ATTAGTGGAC ATCATCATGC CTATTATCCC GGTAAAAAAG ATAACTTAGA ATTACTTCAT
GCGGGGGCAT TAGGAGGGGG ACCCAGAAAG TTATTAAATA GTAATCTTTC TCCTCGCAAA
ACCATAACAG TCGTTGATAT TAATTTAACG TCTCAGTCAA CCACTTACAC GACTTATGAC
ATGAAAACCA AACAGGTTAT TGATATTAAA ACCTTACCTC AGTCTATTGG CAAAGTATGG
CGAAGAGATC TTAAATAA
 
Protein sequence
MNYNRRQFII FLCCLLGVVM ATVSHQVFSR NNITSEPPSN VVENPIETPI NEPVAAAPSG 
LFAPVKGDVR IVVISDLNSQ YGSTSYEPEV KEAIALTPQW KPDLVLCGGD MIAGQKRSLT
QQQIQAMWSA FDANISKPLR QAKIPFGFTI GNHDGSGAIS QGKLIFKSER DLASTYWNQP
QNNPGLNFVD RGNFPFYYSF IQKDIYYLVW DASTHIISSE QLAWVEKNLA SPVAQNAKLR
LVIGHLPLYP VAVGRNDGGN FLSNAEKLQA LLERYQVHTY ISGHHHAYYP GKKDNLELLH
AGALGGGPRK LLNSNLSPRK TITVVDINLT SQSTTYTTYD MKTKQVIDIK TLPQSIGKVW
RRDLK