Gene Cyan8802_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2303 
Symbol 
ID8391623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2317861 
End bp2319465 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content38% 
IMG OID644980273 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003138015 
Protein GI257060127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.832541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTACA CAATTACTGA CAGTTGTCCA AACTGTACAA GCTGTCAAAT CGACTGTCCA 
ACCGATGCTA TTCAATTGCA CAATGGGGCA TATTCAATTG ATGAAAAACT CTGTAATAAC
TGTCAAGGTT ACTATGCAGA GCCCCAATGT ATTATCCAAT GCCCTATTAG TAGTCCGATT
CCTACCCACG CCAAGAAAGG CAGATACAAA GCAGTAGAAA GAGTTGCAAT TTTTGAAGAT
CTTTTTGTCA ATGGTAATAA TACTCCTTTT GCTTCATCAA TGGTCATTTG GGAAGCGTGT
AACCTATTAA CCAGTGCCAC AATTCTTCCT TGGGAAAAAG ATGTTGATGG AACCTTATAT
TACCAGCGAG AGGTTAAAAA AGGTCGAGGA AGTATCATTT TTCGGTTAAG TAATACACCA
GAATTAGCTA CCAATCAAAC CATTGACTAT GCCTCAGACT TTTCATCAAT AGAATCCCTA
GACATTCGCT CTTCTGTACT ACACTTAATT TATGCTGCCT ATGCGACAAC ATTAGATAAA
CCTTGGGAAC AAGAATTTGT TATCAATGAT CAACAAATAG AACATTATTT AGGCTTAGAT
AAACGCAAAG ATCTCAGTAA AGCCACTAAG CTATCTTTGA TCAAGAATTT AGTTCAGCAA
CCTTGTCAAT TAAGGGCTGC CATTGATTGG CCGCAACAAG GTAAAGTTAA AGGATTTTAT
GTTCCAGAAA GTCCTCTTTG GCATTTAGTT GATATTCAAC ATCATTTTCA GGAAGACTCC
CTAGGATGCA AACATCTGAT TGGATTAACA TTTACTGTAA AACCCGGACT GTGGGCTAAA
TTTTTCTTAA ATAAACAAGA CTATAGTCAT CGGATTGCCT TTTATCAATA TGGTTCCCTT
CCCAAGTTTT TGTTAAATAC CGTGATGAGT ATTTGGCAAC AACATCAAGG GGCAGTTCGC
ATTATGCTAT GGTTGCTTTT TAAAAGTAAA ATGGGTCGCA AACAATGCTT AACGGTTTCT
ACCTTAATGC GAGTGGCTTA TGGCCAAGAA AAAGTTAACC TAGCCAGCTT ACAACGCGAA
CAACGAAAGC GACTTATTCG GTCTTTTGAA AGTGATCTCG AAGTGTTAAA TCATTATGGA
CTGAAAGCCG TTTTTGATCC AATTTCCTAT CCTGAAACGA TTCAACCAAT GTGGGTTAAA
TTAGCACAAC TCCCTGATGA TGCTGATGAA GCCGTGGAAT TTTGGATTCA TGATGGTTCT
CAAGAACATC GCTTAACAGA TTCTGGTCCT AGGGGGAAAT GGAATCAGTT AATTAAAGCG
CGGATTCTTA CTTTTGAATT ACCCCCAGAA TGGGAAGAAC AATTAGCAAA ATTCGAGCGT
AAAAAACAAC AAATTACTAA TCGAAAAACC CGTTCTAAGA AAGTTGGTGA ACTAACTTCT
GATCAGATTT TAGCAGCGAG GCAACGTCAG GGAATGAGTC AGAGAGCACT AGCCGAAAAA
TTAGGAAAAA GTCAAAGTTG GATTCGAGAT TTAGAACGAG GACGCTTCTC CGCAAAACCT
GAAGATCGAG CTATTCTGCA AACGGTTTTA GGATTACAAT CTTGA
 
Protein sequence
MSYTITDSCP NCTSCQIDCP TDAIQLHNGA YSIDEKLCNN CQGYYAEPQC IIQCPISSPI 
PTHAKKGRYK AVERVAIFED LFVNGNNTPF ASSMVIWEAC NLLTSATILP WEKDVDGTLY
YQREVKKGRG SIIFRLSNTP ELATNQTIDY ASDFSSIESL DIRSSVLHLI YAAYATTLDK
PWEQEFVIND QQIEHYLGLD KRKDLSKATK LSLIKNLVQQ PCQLRAAIDW PQQGKVKGFY
VPESPLWHLV DIQHHFQEDS LGCKHLIGLT FTVKPGLWAK FFLNKQDYSH RIAFYQYGSL
PKFLLNTVMS IWQQHQGAVR IMLWLLFKSK MGRKQCLTVS TLMRVAYGQE KVNLASLQRE
QRKRLIRSFE SDLEVLNHYG LKAVFDPISY PETIQPMWVK LAQLPDDADE AVEFWIHDGS
QEHRLTDSGP RGKWNQLIKA RILTFELPPE WEEQLAKFER KKQQITNRKT RSKKVGELTS
DQILAARQRQ GMSQRALAEK LGKSQSWIRD LERGRFSAKP EDRAILQTVL GLQS