Gene Cyan8802_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_0843 
Symbol 
ID8390151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp849313 
End bp850563 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content48% 
IMG OID644978862 
Productcompetence/damage-inducible protein CinA 
Protein accessionYP_003136616 
Protein GI257058728 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00199] competence/damage-inducible protein CinA C-terminal domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCAG AAATTATTTG TGTGGGAACG GAGCTATTAT TGGGGGATAT TCTTAATAGT 
AATTCTCAAT TTTTAGCGAA AGAATTAGCT CGGTTAGGCA TTCCCCATTA TTATCAAACG
GTTGTCGGCG ATAATCCCAG TCGGTTAAAA CAAGTGATTG AAATTGCCAG TAATCGGGCT
TCTATTTTAA TTTTCACTGG GGGATTAGGA CCAACTCCTG ATGATTTAAC CACAGAAACG
ATCGCTGATT TTTTCAACAC TCCTCTAGTG GAACGTCCCG AAATTATTGA AGATATGAGT
CGGAAATTCG CAGCACGAGG GCGAACCATG ACGGATAATA ACCGTAAACA GGCGTTACTC
CCCCAAGGGG CTGATATTTT AGCTAATCCC TTGGGAACGG CTCCGGGTTT GTTGTGGCAA
CCTCGCCCGA ATTTAACCCT AATGACTTTC CCTGGAGTCC CCTCGGAAAT GAAGCGGATG
TGGCAAGAAA CGGCGATTCC TTACCTGAAA AACCAAGGTT GGGGCAAAGA AATCATTTTT
AGCCGTATGC TGCGTTTTAG GGGCATTGGT GAGTCAGCCT TGGCCGCGAA GGTATCCCGA
TTTTTTGACT TAACTAATCC TACGGTCGCT CCCTATGCGT CGTTAGGAGA AGTGCGTCTG
CGGGTGTCGG CTAAAACAAG GTCAGAACAG GAGGCCATCG CCCTAATTGA CCCCGTTGCC
CAAGAATTGC AGAAAATCGC CGGATTGGAC TATTATGGCT CTGATGATGA GACGTTAGCT
TCTGTTGTCG GGAGTGTATT GCGGCAAAAG GGAGAGACGG TGAGTGTCGC GGAGTCCTGT
ACAGCAGGGG GGTTAGGTTC GGTTCTAACG TCTGTTGCAG GGAGTTCGGA CTATTTTCGG
GGGGGGATTA TTGCCTATGA TAATTCGGTG AAAGTGGATT TATTAGGGGT TAATCCGGCA
GATTTAGAGC AATATGGAGC GGTTAGTGAT ATTGTTGCAC AACAAATGGC TCTGGGGGTT
AAACAACGCT TAGGGACTGA CTGGGGAGTG AGTATAACCG GAGTTGCGGG GCCTGGTGGG
GGGACGGACA CAAAACCTGT GGGTTTGGTG TATGTTGGGT TAGCGGATAG TCAGGGACAG
GTGGAGAGTT TTGAATGTCG GTTTGGGACA GAACGCGATC GGGAAATGGT GCGATCGCTA
AGTGCTTATA CTGCGTTGGA TCACTTACGT CGGAAATTGT TGGTTAGATA G
 
Protein sequence
MSAEIICVGT ELLLGDILNS NSQFLAKELA RLGIPHYYQT VVGDNPSRLK QVIEIASNRA 
SILIFTGGLG PTPDDLTTET IADFFNTPLV ERPEIIEDMS RKFAARGRTM TDNNRKQALL
PQGADILANP LGTAPGLLWQ PRPNLTLMTF PGVPSEMKRM WQETAIPYLK NQGWGKEIIF
SRMLRFRGIG ESALAAKVSR FFDLTNPTVA PYASLGEVRL RVSAKTRSEQ EAIALIDPVA
QELQKIAGLD YYGSDDETLA SVVGSVLRQK GETVSVAESC TAGGLGSVLT SVAGSSDYFR
GGIIAYDNSV KVDLLGVNPA DLEQYGAVSD IVAQQMALGV KQRLGTDWGV SITGVAGPGG
GTDTKPVGLV YVGLADSQGQ VESFECRFGT ERDREMVRSL SAYTALDHLR RKLLVR