Gene PCC8801_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2251 
Symbol 
ID7102487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2321838 
End bp2323442 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content38% 
IMG OID643475300 
Producttranscriptional regulator, XRE family 
Protein accessionYP_002372429 
Protein GI218247058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTACA CAATTACTGA CAGTTGTCCA AACTGTACAA GCTGTCAAAT CGACTGTCCA 
ACCGATGCTA TTCAATTGCA CAATGGGGCA TATTCAATTG ATGAAAAACT CTGTAATAAC
TGTCAAGGTT ACTATGCAGA GCCCCAATGT ATTATCCAAT GCCCTATTAG TAGTCCGATT
CCTACCCACG CCAAGAAAGG CAGATACAAA GCAGTAGAAA GAGTTGCAAT TTTTGAAGAT
CTTTTTGTCA ATGGTAATAA TACTCCTTTT GCTTCATCAA TGGTCATTTG GGAAGCGTGT
AACCTATTAA CCAGTGCCAC AATTCTTCCT TGGGAAAAAG ATGTTGATGG AACCTTATAT
TACCAGCGAG AGGTTAAAAA AGGTCGAGGA AGTATCATTT TTCGGTTAAG TAATACACCA
GAATTAGCTA CCAATCAAAC CATTGACTAT GCCTCAGACT TTTCATCAAT AGAATCCCTA
GACATTCGCT CTTCTGTACT ACACTTAATT TATGCTGCCT ATGCGACAAC ATTAGATAAA
CCTTGGGAAC AAGAATTTGT TATCAATGAT CAACAAATAG AACATTATTT AGGCTTAGAT
AAACGCAAAG ATCTCAGTAA AGCCACTAAG CTATCTTTGA TCAAGAATTT AGTTCAGCAA
CCTTGTCAAT TAAGGGCTGC CATTGATTGG CCGCAACAAG GTAAAGTTAA AGGATTTTAT
GTTCCAGAAA GTCCTCTTTG GCATTTAGTT GATATTCAAC ATCATTTTCA GGAAGACTCC
CTAGGATGCA AACATCTGAT TGGATTAACA TTTACTGTAA AACCCGGACT GTGGGCTAAA
TTTTTCTTAA ATAAACAAGA CTATAGTCAT CGGATTGCCT TTTATCAATA TGGTTCCCTT
CCCAAGTTTT TGTTAAATAC CGTGATGAGT ATTTGGCAAC AACATCAAGG GGCAGTTCGC
ATTATGCTAT GGTTGCTTTT TAAAAGTAAA ATGGGTCGCA AACAATGCTT AACGGTTTCT
ACCTTAATGC GAGTGGCTTA TGGCCAAGAA AAAGTTAACC TAGCCAGCTT ACAACGCGAA
CAACGAAAGC GACTTATTCG GTCTTTTGAA AGTGATCTCG AAGTGTTAAA TCATTATGGA
CTGAAAGCCG TTTTTGATCC AATTTCCTAT CCTGAAACGA TTCAACCAAT GTGGGTTAAA
TTAGCACAAC TCCCTGATGA TGCTGATGAA GCCGTGGAAT TTTGGATTAA TGATGGTTCT
CAAGAACATC GCTTAACAGA TTCTGGTCCT AGGGGGAAAT GGAATCAGTT AATTAAAGCG
CGGATTCTTA CTTTTGAATT ACCCCCAGAA TGGGAAGAAC AATTAGCAAA ATTCGAGCGT
AAAAAACAAC AAATTACTAA TCGAAAAACC CGTTCTAAGA AAGTTGGTGA ACTAACTTCT
GATCAGATTT TAGCAGCGAG GCAACGTCAG GGAATGAGTC AGAGAGCCCT AGCCGAAAAA
TTAGGAAAAA GTCAAAGTTG GATTCGAGAT TTAGAACGAG GACGCTTCTC CGCAAAACCT
GAAGATCGAG CTATTCTGCA AACGGTTTTA GGATTGCAAT CTTGA
 
Protein sequence
MSYTITDSCP NCTSCQIDCP TDAIQLHNGA YSIDEKLCNN CQGYYAEPQC IIQCPISSPI 
PTHAKKGRYK AVERVAIFED LFVNGNNTPF ASSMVIWEAC NLLTSATILP WEKDVDGTLY
YQREVKKGRG SIIFRLSNTP ELATNQTIDY ASDFSSIESL DIRSSVLHLI YAAYATTLDK
PWEQEFVIND QQIEHYLGLD KRKDLSKATK LSLIKNLVQQ PCQLRAAIDW PQQGKVKGFY
VPESPLWHLV DIQHHFQEDS LGCKHLIGLT FTVKPGLWAK FFLNKQDYSH RIAFYQYGSL
PKFLLNTVMS IWQQHQGAVR IMLWLLFKSK MGRKQCLTVS TLMRVAYGQE KVNLASLQRE
QRKRLIRSFE SDLEVLNHYG LKAVFDPISY PETIQPMWVK LAQLPDDADE AVEFWINDGS
QEHRLTDSGP RGKWNQLIKA RILTFELPPE WEEQLAKFER KKQQITNRKT RSKKVGELTS
DQILAARQRQ GMSQRALAEK LGKSQSWIRD LERGRFSAKP EDRAILQTVL GLQS