Gene PCC8801_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0472 
Symbol 
ID7105012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp483020 
End bp484129 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content38% 
IMG OID643473581 
ProductAAA ATPase 
Protein accessionYP_002370724 
Protein GI218245353 
COG category[R] General function prediction only 
COG ID[COG3950] Predicted ATP-binding protein involved in virulence 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAG AATCCATTAC AATTCAAAAT TTTAAGCGAT TTGATAATAT TGAGGTTTCT 
TTTAAGAACA AAACCCTGCA AGAGGTTACT AATCGCTTCC TAATTCTTGG AGACAATGGG
ACAGGTAAAA CAACGCTTCT CCAGGCAATT GCTCTCCCAC TAGCTCTAGC TACAAAAAAA
ATTCAAACAG TATCTGAATT TGACTGGGTA GGTTTTTTGC CAGGTCGATT TTGGATTGGA
GGTTCACCCC ATATTGAACT AGAAATATCA TTTGAAGATG AGGAACTTGA AGCAACAAAG
TCGGTAGCTA GAAGATGGTA CGAGAAGCAA CCAGTTGAAT TTCGTCCTCC TGATTTTGTT
GAACCTGGTA ATAGTCATTT AGTTAAGTTA ACTCTCAATG GAGAATATTG GAAGGTTGGA
GAAGATAATA AACTTGAAGA ACGCTCTCAA TTTCAAGGTC GTTACTATGC TCAAAGATTG
ATGAGAAGTG ACCCTTCTGT GCGCTCTGAA TTTTCTAGAC TTCCTGGTAT TTTTTGGTTC
GATCAGTTTC GAAATCTTGG TTCAAATCCA CTGACTGAAA GTAGTGGAGA TGGACAAACA
GATCATACGG CTGGCATTTC GTTTGATCTA GGTGTAGGAC GTTTACGTCA GTATTTAATT
CAGTGGGATC AAAAAAGAAG AACAGGGCAA AATAATACTT CTATTGACTA TCTCAAAGAA
TTACAGATTT ATTATACAAA GGTTTTTCCT GAACGTTCAT TCAGTGGAGT TGAATATCAA
CCCAGTAATG ATTCGCCAAC GGAAATGAAT ACATATTTTA CCCTATATGA CGGGCATCGA
ACTTATGATA TTGTTGAGAT GTCAGCAGGA GAACAAGCAG TTTTTCCGAT GCTCTATGAG
ATTGTTAGAC AGCAAATTTC ATACTCAATT GTTTTAGTCG ATGAAATTGA TTTAAACCTT
CATCCTCCAG CAGCTCAGCT CTTGGTTAAT CAACTTCCCA AGATTGCTCC TACTTGTCAA
TTCCTATTCA CAACTCATTC TGAGGCTGTA AATGATGTAA TTGGCGAAGA GGAAACTTAT
CGATTGCCAG GAGGGTCTTT GTGCCTGTAA
 
Protein sequence
MKVESITIQN FKRFDNIEVS FKNKTLQEVT NRFLILGDNG TGKTTLLQAI ALPLALATKK 
IQTVSEFDWV GFLPGRFWIG GSPHIELEIS FEDEELEATK SVARRWYEKQ PVEFRPPDFV
EPGNSHLVKL TLNGEYWKVG EDNKLEERSQ FQGRYYAQRL MRSDPSVRSE FSRLPGIFWF
DQFRNLGSNP LTESSGDGQT DHTAGISFDL GVGRLRQYLI QWDQKRRTGQ NNTSIDYLKE
LQIYYTKVFP ERSFSGVEYQ PSNDSPTEMN TYFTLYDGHR TYDIVEMSAG EQAVFPMLYE
IVRQQISYSI VLVDEIDLNL HPPAAQLLVN QLPKIAPTCQ FLFTTHSEAV NDVIGEEETY
RLPGGSLCL