Gene PCC8801_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3663 
Symbol 
ID7102916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3824473 
End bp3826275 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content45% 
IMG OID643476678 
Producthypothetical protein 
Protein accessionYP_002373781 
Protein GI218248410 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTG TTATTGGTAT TGTTATTGGT TTAGTCATAG GAGCCGTCGT TGCTTACCTT 
GTAGCTCATT CTGCTGGTGA ACGTAAGCTA AAAAGCCAAG AAAGCCAACT AGAGAGAGCC
AAAAAAGCCA TAGAAGAATT ACAAAGCGAT AATAGACAAC AAGCAGCACA AATTCAGCAA
TTACAACAGG GATCATCAAG TCAAGAGATA GAGCAGGCCT ATCAAGGGAA AATTCAAGAA
CTTGAAGAAC TCTATCAAGG AAAAGTGGCA GAATTAGAAG AAATACAAGC CCAAATTCAA
GCCACAGAGC AATCCTATCA TGCTCGCATC CAAGACACCG AAAAATCCTA TCAAGCCCAA
CTGCAAGAGA TCGAACGATC TTATCAAAGC CAAATCGAAC AACTACAACA AACTCATAGT
TCAGCCGTTG TCTTAGAAGC CAGTCGCAGC CAAATGCGAG AAATGGGACA CGATTATCAA
ACCCAAATAG AGCAACTACA ACAGGCCCAT CAACAAGCGA TCGCCGACCT AGAAACCGCC
CATCAAACTC AACTTCAAGC CATTGAACAA GCCCATCAAA CCGAAATTAA GGAGATTGAA
TCAACCTATC AACAGCAAAT TCAAGCCTTA CAACAACCCC AACAGCCAGA TATAACTGCC
AAAGAGTCCA TGATAGCAGC CGCTGGCATT GCTGGTGTGG CTGGTATTGC CGCCGGAGTG
GCAGCCCTAA CCCATGAGCA AAAAGAAGAA ACAGCCCAAG TCGCTGCACC CGAACTCGAA
GAATTATCCG CCGTCGCTGA CCTCGAAACC GAAGGACTAT TAGGTGGCTT AAGCAGCGAG
GAAACCGATA ATTTAACCCT AGACAACTTC CTAGAAGAAG AAACAGCCCA AGTCGCTGCA
CCCGAACTCG AAGAATTATC CGCCGTCGCT GACCTCGAAA CCGAAGGCCT ATTAGGCAGC
TTAAGCAGCG AGGAAACCGA TAATTTAACC CTAGACAACT TCCTAGAAGA AAAAACAGCC
CAAGTCGCTG CACTCGAACT CGAAGAATTA TCCGCCGTCG CTGACTTTGA AACCGAAGAC
CTATCAAGCG GCTTAAGCAA TTTAACCCTA GACAATTTCC TAGAAGAAGA AACAGCCCAA
GTCGCTGCAC CCGAACTCGA AGAATTGTCC GCCGTCGCTG ACCTCGAAAC CGAAGACCTA
TTAGGCGGCT TAAGCAGCGA GGAAACCGAT AATTTAACCC TAGACAACTT CCTAGAAGAA
GAAACAGCCC AAGTCGCTAC ACCCGAACTC GAAGAATTGT CCTTTATTGG TGAACAAATA
ACTCCAGACC TATTAGGGGA ATTAAATCTT GAGACATTAC AAGAAACTGC CAATATTTCA
ACGGCTGACA GGGCATTACT AGAGATGCTT CAAAATGACG AAGATATGGA CTCTCCCTCC
ACCGAGTTTA ACCTTCAAGA AATAGATGAC CTCTCTTCTG GGTTGTTACC CGAGCTAGAC
TCTCGCACTG ATGTAGACTT ATTGGAAATG CTACCCACCG AAGCCGAAAA AACAACGAAC
CCAGACAAAG ATCCCTTTGC CGATTTCTTT GAACCTGACT CCGAAACTGT AACAGCCGAC
CCCAATGATC CTTTTGCTGC CATCCTAGGC ATAGATTCTG AACATTCCAG TGAAAATTTA
ATGGATTTGT TATCCGATGA TCATCCAGAC GACTCTCACA TCAGTCAATC CCTTGCTCAA
GGCACTGAAG ACGAGTGGGA CAGTCTATTT GATGAAATTG ATGACAAGAC TCCCGCTAAA
TAG
 
Protein sequence
MDIVIGIVIG LVIGAVVAYL VAHSAGERKL KSQESQLERA KKAIEELQSD NRQQAAQIQQ 
LQQGSSSQEI EQAYQGKIQE LEELYQGKVA ELEEIQAQIQ ATEQSYHARI QDTEKSYQAQ
LQEIERSYQS QIEQLQQTHS SAVVLEASRS QMREMGHDYQ TQIEQLQQAH QQAIADLETA
HQTQLQAIEQ AHQTEIKEIE STYQQQIQAL QQPQQPDITA KESMIAAAGI AGVAGIAAGV
AALTHEQKEE TAQVAAPELE ELSAVADLET EGLLGGLSSE ETDNLTLDNF LEEETAQVAA
PELEELSAVA DLETEGLLGS LSSEETDNLT LDNFLEEKTA QVAALELEEL SAVADFETED
LSSGLSNLTL DNFLEEETAQ VAAPELEELS AVADLETEDL LGGLSSEETD NLTLDNFLEE
ETAQVATPEL EELSFIGEQI TPDLLGELNL ETLQETANIS TADRALLEML QNDEDMDSPS
TEFNLQEIDD LSSGLLPELD SRTDVDLLEM LPTEAEKTTN PDKDPFADFF EPDSETVTAD
PNDPFAAILG IDSEHSSENL MDLLSDDHPD DSHISQSLAQ GTEDEWDSLF DEIDDKTPAK