Gene PCC8801_2272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2272 
Symbol 
ID7105109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2343277 
End bp2344413 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content44% 
IMG OID643475318 
ProductGUN4 domain protein 
Protein accessionYP_002372447 
Protein GI218247076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC AAACATCAAC TCCGTTGCTT TTTATTTCTT ACCGTCGAGA CGATAGTGCT 
GATGTAACGG GGAGAATTTA TGATCGTTTA ATTCAATATT TTGGGAAAGA CACGATTTTT
AAAGATGTGG ACTCGATTCC CATCGGCGTT GATTTCCGTC AGTATATCGA TCAAGAAGTG
GGGCGATGTC AAATCTTATT AGCGATTATT GGTCAACAAT GGCTCAATAT TACTGATACC
ACGGGAAAAC GTCGCCTAGA CGATCCCCAA GATTTTGTTA GACTCGAAAT TGAATCCGCC
CTGAAGCGCA ATATTCCCGT GGTTCCCGTT CTGGTTAGGG GAGCAAAGGT TCCTACTGAA
CAAGAATTAC CCCCCAGTTT AAGGGAATTG GCTTACCGGA ATGGGAGTTT AGTGCGATCT
GATCCCGATT TTCACGGAGA TCTCGATCGC TTAATTCTGG GGATTGAGCG CCATCTTGAA
GAACATCAAG CCAAATCGCC TCAACCCTCC TTAAAGACTT CCTTTCCCTT CAAATTCAAG
TCCTGGTGGT TGCTAGGAGG ATTAGGGGGG GCGATCGCTC TTATCCTGGG TATTGGCTCG
CTTTTGTCCC AAGTTTCGAT CTTTGTTGAC ATTCAACCCC TTCAATACAA ACAACTGGAA
AAATTTTTAA ACGAGCAAAA TTGGCAAGCG GCTGATCGAG AAACGGCAAA AATCATGTTA
GCAGCAACGG GAAGAGAACA AGAAAAATGG ATCGATAAAA AGGGGATCAA TCAGATGTCT
TGCCAAGAGA TTCGCAAGAT CGACGATCTT TGGCTCAAAG CGAGTCAAGG AAAGTTTGGG
TTTAGTACAC AGCGAGAAAT CTGGAGAAAA GTCGCTAATA ACGATAAATT TGGCGATCTA
ATAGGCTGGC GACAGAATAA TCAATGGCTA ACGACCGATC AATTACAGTT TAATTTAAGT
GCACCGAAGG GGCATTTACC GTCGAGTTCC CGTGAAGGCA AATTATCAGG GGGATGGTTA
GTCTGGTATT TATTACCGAT GACGACGACG GGCAATCAAT CAGATTCTAA GGCGAGTCAG
TGTTGGCCAG AGGAAAAAGC AGTTAGTTTC TCCGATTCTG CTTCTGCATT TTCTTGA
 
Protein sequence
MKNQTSTPLL FISYRRDDSA DVTGRIYDRL IQYFGKDTIF KDVDSIPIGV DFRQYIDQEV 
GRCQILLAII GQQWLNITDT TGKRRLDDPQ DFVRLEIESA LKRNIPVVPV LVRGAKVPTE
QELPPSLREL AYRNGSLVRS DPDFHGDLDR LILGIERHLE EHQAKSPQPS LKTSFPFKFK
SWWLLGGLGG AIALILGIGS LLSQVSIFVD IQPLQYKQLE KFLNEQNWQA ADRETAKIML
AATGREQEKW IDKKGINQMS CQEIRKIDDL WLKASQGKFG FSTQREIWRK VANNDKFGDL
IGWRQNNQWL TTDQLQFNLS APKGHLPSSS REGKLSGGWL VWYLLPMTTT GNQSDSKASQ
CWPEEKAVSF SDSASAFS