Gene PCC8801_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3472 
Symbol 
ID7101563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3627033 
End bp3628184 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content40% 
IMG OID643476484 
Producthypothetical protein 
Protein accessionYP_002373593 
Protein GI218248222 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCCG TTGTCGTATT TGTTACCCAT TTGATTTTAG TCATTTTTTT AACTGCTTGT 
GGGATATCTC CTCAAGTTCT GGCCGAACAG CGATTATTTC CCCCCGTTTC CCTAGAATTT
TTAGGAGAAT ACCAGCTACC GAAGCAAACC TTTGAGGGAA CCTCCGTTGG AGGACTATCG
GGATTAAGCT ATGATCGTCA ACGCGATCGC TTTTATGCTT TATCAGATGA TCGCTCCCAA
AAAGCACCCG CTCGATTTTA TAGCTTAAAA TTGTCAATCT CTGACGGAAA TGATGGCAAA
ACCGAGATTA ACAGCATCAC GGTTGAAGCA GTGACGTTTC TGAAAAATTC ATCAGGGGAG
TTTTATCAAG TTCAAACCAT TGATCCCGAA GGAATTGCTC TTTCTCCTAG GGATACAGTC
TTTATTAGTA GCGAAGGAGT TCCCAAACGG GGAATTAATC CTTTTATTGG AGAATTTAAT
CTCAAAACGG GTCAACTAGA ACGAACTTTA CCCCTACCTG AACGGTTTTT ACCGGGAAAG
GAACCCGATG GAACTCCTCG CGGGGTGGAG GATAATTTAG GGTTTGAGTC TTTAACCATT
AGTGCAACCA GTACCCTGAA AGATGATCCA TTTCGGTTAT TTACGGCAAA TGAATGGTCG
TTAAGTCAAG ATACGGCTCA AACTGACCAA AAACAAAAAC CCCTGCGATT ACTGCATTAT
GGCATTAATT CTATTGGATC TCCTGTACTG ATTGCTGAAC ATTTATATCT GTTGGATGAG
ACTCCTAACG GGGTAGTTTC TAATGGGTTA ACGGATTTAT TGGCCTTACC TCAAGAGGGA
TTTTGGTTAA GTTTAGAACG AACTTTTGGA TTATCAGGAA ATGGTGCAAA ATTGTTTGAA
CTGGTTAATA GTAATGCGTC AGATATTTCT ACTCGATTGA AGCTCACAGG GGACTTAAAA
GATATTAACC CATTACAAAA GAAGTTATTA TTAGATTTGA GCGACTTAGG GATTGAGTTA
GATAATTTAG AAGGGATGAC GTTTGGTCCT CGCTTATCCG ATGGGAGTCA GTCTTTAATT
TTAGTGAGTG ATGATAATTT TAATCAAACT CAGGTGACTC AGTTTCTTTT GTTTCGATTA
AAGCAAGAAT AA
 
Protein sequence
MKPVVVFVTH LILVIFLTAC GISPQVLAEQ RLFPPVSLEF LGEYQLPKQT FEGTSVGGLS 
GLSYDRQRDR FYALSDDRSQ KAPARFYSLK LSISDGNDGK TEINSITVEA VTFLKNSSGE
FYQVQTIDPE GIALSPRDTV FISSEGVPKR GINPFIGEFN LKTGQLERTL PLPERFLPGK
EPDGTPRGVE DNLGFESLTI SATSTLKDDP FRLFTANEWS LSQDTAQTDQ KQKPLRLLHY
GINSIGSPVL IAEHLYLLDE TPNGVVSNGL TDLLALPQEG FWLSLERTFG LSGNGAKLFE
LVNSNASDIS TRLKLTGDLK DINPLQKKLL LDLSDLGIEL DNLEGMTFGP RLSDGSQSLI
LVSDDNFNQT QVTQFLLFRL KQE