Gene PCC8801_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3003 
Symbol 
ID7104494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3109994 
End bp3112984 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content48% 
IMG OID643476032 
ProductATPase, P-type (transporting), HAD superfamily, subfamily IC 
Protein accessionYP_002373146 
Protein GI218247775 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAC CTAATTCAAT TGATTGTCAT CAGCCAAGGG AAACATCTTT TGTCAAAATT 
CTCCACGGAA CTGTCAAAGG AAGAGGTCGA TACAAAGTTC AAGGATTATT CGGTTCAAAC
CCACTCAAAC GGTATTTAGA GTTCAATTTA TCCCTACGAA AAGGCATAAA AACCGTTCAA
GCTAACCCCA ATACAGGCAA TATTCTTCTA CACTTCCATA AAGACAAAAC AACTCAAGAA
ATAGCCATCC TCATTGATTC CCTGGTCCAA GATTATTATC GATCGCCTAA AGAGTTCGCC
CAACTTTTTT ATCAACAAAC AATGGTTATC CTTAACACCT CAGTTCCTTG GCATCTCAAA
GAGATTGATA CCATTGTTGC GGAACTCAAC ACCTCAACAG AAAGGGGACT TTCCCACGCC
GATGCCCAAA CTAACCTAAG ACAATATGGG AGTAATGCCC TCACTGAAGC TGAACCCCGT
TCTGGGTTGA GCATTTTTCT TGACTACTTC AAATCTGTTC CCGTTGCCCT GTTAAGCGGT
GCAGCCCTTC TTTCCGTCCT AACCGGAGGC ATCGCTGATG CGATCGTAAT CATGGGAGTT
GTCAGTATTA ATGCCATTTT AGGGTATGTC ACCGAAAGTA ACTCAGAACG GATCATTAAT
TCCCTCAAAC ACTTTATTAA CCCCTCTGCC TGGGTACTTC GAGAAGGCCA ATTAATTGAA
ATTAATAGCC AAGATCTGGC CGTCGGGGAT ATTTTACTCC TACAACCCGG TTCCTATGTT
CCGGCTGATG CCCGATTAAT CGAAGCTGAT CGCCTGAGTA TCGATGAATC TGCCCTAACC
GGGGAAAGTT TGCCCATCCG TAAACACCAA GAGATCCTAG CCTCTCCTCA AGAAACTATT
CCTCTAGCGG AACGGAAAAA TATGGTCTAT CGGGGTACAT TTGTCACGGG AGGCCAAGGC
CGAGGGGTGA TAGTCTCCAC GGGAAATTCA ACGGAAATGG GACAAATCCA GTCCTTAGTC
GGGGAAACTA GCCAACCGTC TACTCCCATG GAACGACAAC TCGAACAAGC CGGAAGCCAA
CTGGTTTTAC TGTCGAGTGT GGTTTGCGCT TTAGTCTTTG CCATCGGATT GTTACGAGGA
TACGGCTTGC TAGAAATGGT AAAAAGCTCC ATTTCTCTCG CGGTTGCTGC TGTTCCCGAA
GGATTACCCA CCGTAGCTAC TACTACCCTG GCCTTGGGTA TCCTGAATAT GCGTAAACAA
AAAGTTCTGA TTCGTCGGTT AGAAGCGATC GAAGCCCTCG GTTCTATTCA AGCCCTCTGC
TTGGATAAAA CAGGCACGCT AACGGCTAAT CGGATGACGG TATTAAAGGT GTGTTGGGAT
GGACGGGAGA CTAAGCTGGC AGATGGTCAT TTTTGGGTAG ACAATCAAGA AATCAATCCC
TATAGCTGCG ACGAATTATT AAAACTGATT CATATCGCCG TCCTGTGTAA CGACAGTCAG
ATTAATACTC ATCAAGACGG AACCTACATC ATTGACGGTT CTGCGACAGA AAATGCCTTG
ATAGAAATGG CGATCGCAGC CGGAGTCACC GTTGCCGACC TCAATCACAA ATATCCGCGC
CTTCTGACCT ACCACCGTTC CACCGAACAC AATTTCATGG CCACGGTGCA TCGCATCCAT
GAATCCGCCT ATCTCATGGC CGTCAAAGGC AACCCCTCAG AAGTCCTCGA TCGCTGTTCA
ACCCAGATGC GAAACGGTCA ACCCGTCGAG TTAACCGAAG CTGATCGACA AGCGATCGAA
GAACAAAATG AAAGCTTGGC CGGCCAAGCT TTACGGGTCT TAGGTATCGC TTACAGCCAA
GGGGAAACCG CCGATATCGA GTCCTTGCCC GTCTCTAATC TCATTTGGGT TGGGCTCATT
GGTATGGCTG ATCCCATTCG CCCCGGGGTT ACAGAAACCA TCGCCGATTT TCACACCGCC
GGGATCAACA CCCTGATGAT CACTGGGGAT CAAAGCCCCA CTGCCTACGC GATCGGCAAA
GAGTTGAATC TCAGCCAAGG ACAACCTCTA AAAATCCTCG ATTCTACCGA ATTAACCGAT
CTTTCTCCAG ACGTATTAGC CGGGTTGTCG GAACAGGTCC ATATTTTTGC CCGAATTAGC
CCTGCCCACA AACTTCAGAT TGTCCAGGCA CTCCAGCAAC GCGGCTTAGT TGTGGCCATG
ACGGGGGATG GCATTAATGA TACTCCGGCC TTGAAAGCGG CAGAAGTGGG CATTGCCATG
GGACATACGG GGACGGATGT CGCGCGAGAA GTCGCTGATG TTGTCCTAGA GGATGATAAC
CTGCAAACGA TGATTATTGC GGTGAGTCAG GGACGCACGA TTTACAACAA TATTCGTAAA
TCGGTTCATT TTCTGCTGTC AACCAACCTC AGCGAAATTA TTGTCATGTT ATTCGCTACC
ACTGGAGGAC TCGGACAACC CCTGAACGCG ATGCAGCTAC TGTGGCTCAA TTTAGTCACC
GATATCTTTC CGGGGTTAGC CTTAGCCCTA GAAGCCCCTG AACCGGACGT TTTAACCCTT
CCGCCGCGAT CGCCTGATGA ACCGATTATT AAATCCTCCG ATTTTCGACG GATTGTGTGG
GAATCGACGG CGTTATCGGT GAGTTCTTTA GCGGCCTATG GCTATGGTAT CGCTCGTTAT
GGGATCAGTC CCCATGCTAG TACCATCGCG TTTATGAGTT TGGTTAGCGG ACAACTCCTC
CATGCCCTCA GTTGTCGGTC ATCAAGACCT TTACGAAGCC AACAACTGCC CCCAAATCCC
TACTTAACGG GAGCCCTGGC TGGATCGATG GGGCTTCAGT GGGTATCCTT GGCTACCCCT
GGATTGAGAA ATCTCTTACA CCTGACTCCC CTTAATCTGG CTGATAGTTT GGTGATTGGA
GGTAGTGCTA TTTTGCCGTT GATTATCAAT GAAGGAACGA AACCTCAATA A
 
Protein sequence
MKRPNSIDCH QPRETSFVKI LHGTVKGRGR YKVQGLFGSN PLKRYLEFNL SLRKGIKTVQ 
ANPNTGNILL HFHKDKTTQE IAILIDSLVQ DYYRSPKEFA QLFYQQTMVI LNTSVPWHLK
EIDTIVAELN TSTERGLSHA DAQTNLRQYG SNALTEAEPR SGLSIFLDYF KSVPVALLSG
AALLSVLTGG IADAIVIMGV VSINAILGYV TESNSERIIN SLKHFINPSA WVLREGQLIE
INSQDLAVGD ILLLQPGSYV PADARLIEAD RLSIDESALT GESLPIRKHQ EILASPQETI
PLAERKNMVY RGTFVTGGQG RGVIVSTGNS TEMGQIQSLV GETSQPSTPM ERQLEQAGSQ
LVLLSSVVCA LVFAIGLLRG YGLLEMVKSS ISLAVAAVPE GLPTVATTTL ALGILNMRKQ
KVLIRRLEAI EALGSIQALC LDKTGTLTAN RMTVLKVCWD GRETKLADGH FWVDNQEINP
YSCDELLKLI HIAVLCNDSQ INTHQDGTYI IDGSATENAL IEMAIAAGVT VADLNHKYPR
LLTYHRSTEH NFMATVHRIH ESAYLMAVKG NPSEVLDRCS TQMRNGQPVE LTEADRQAIE
EQNESLAGQA LRVLGIAYSQ GETADIESLP VSNLIWVGLI GMADPIRPGV TETIADFHTA
GINTLMITGD QSPTAYAIGK ELNLSQGQPL KILDSTELTD LSPDVLAGLS EQVHIFARIS
PAHKLQIVQA LQQRGLVVAM TGDGINDTPA LKAAEVGIAM GHTGTDVARE VADVVLEDDN
LQTMIIAVSQ GRTIYNNIRK SVHFLLSTNL SEIIVMLFAT TGGLGQPLNA MQLLWLNLVT
DIFPGLALAL EAPEPDVLTL PPRSPDEPII KSSDFRRIVW ESTALSVSSL AAYGYGIARY
GISPHASTIA FMSLVSGQLL HALSCRSSRP LRSQQLPPNP YLTGALAGSM GLQWVSLATP
GLRNLLHLTP LNLADSLVIG GSAILPLIIN EGTKPQ