Gene PCC8801_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1953 
Symbol 
ID7105173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2030889 
End bp2032232 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content38% 
IMG OID643475015 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002372147 
Protein GI218246776 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAG AACAAATAAA TTTAACACCA AAAATAGAGC CTAATATTAG CGGATTTGAC 
GATAAAAATC CTCCTCCTAA AGAACTAATA GATGCTTGTG TACACTGTGG ATTTTGTCTA
TCAACTTGTC CGAGTTATCG AGTTATTGGA AAAGAAATGG ACTCACCACG GGGAAGAATT
TACCTGATGA ATGCCATTGA ACAAGGGGAA GCAGCAATTG ATGAAACAAC CTCCCAACAT
TTTGATAGTT GCTTGGGGTG TTTAGCGTGT GTTAGCACTT GTCCGTCTGG GGTACAGTAT
GATCAATTAA TCGCAAAAAC TCGGCCACAA GTTGAACGAA ATCAACCGAG AACGTTAAAA
GACAAAATTA TTAGATCTCT AATTTTTAAT ATCTTTCCCT ATCCTCAACG GTTACAGATT
TTTTTACCAT TATTGTGGCT ATATCAAAAA TTAGGGATTC AAAAATTAAT CCGTGCGACA
GGGTTATTTA AAAAATTCTT TCCCCGTTTA GCAGCGATGG ATTCTATTTT GCCAGAAATT
ACCCTAAAGA AATCTCAAAC CTTCCCTGAT ATTATTCCGT GTCAAGGGAC AAAACGCTAT
CGCGTGGGGA TGATTTTAGG CTGTGTCCAA CGGTTATTTT TTTCCCCTGT CAATGAAGCA
ACTGCCAGGG TTTTGACGGT AAATGGGTGC GAAGTTGTCA TCCCGAAAAC CCAAGGATGT
TGCGCTGCTT TACCGGAACA CCAGGGACAA GAAGAACAAG CACAAACCTT AGCAAAACAA
ATGATTGATA GTTTTGCAGA AACTGACGTT GATTATATTA TTATTAATGC CGCCGGGTGC
GGCCATACCT TAAAAGAATA TGGTCATATT TTAGCCGATG ATCCCGACTA TAGAGAGAAG
GCTAAACAGT TTTCTGAAAA AGTAAAAGAT GTTCAAGAAT TTTTAGCAGA AGTGGGGTTA
ACTGCACCGT TAAACCCCTT AACTGAGGGA AAATTGATGA TGGTGTATCA GGATGCTTGC
CATCTATTAC ACGGTCAAAA AATTAGCTTA CAACCGCGAC AATTATTACA AAAAATTCCA
GGGATAATCT TAAAAGAACC CGTTGATGCT GCCTTATGTT GTGGTAGTGC AGGGGTTTAT
AATATGTTGC AACCCGAAGT TGCTGAAGAA TTAGGACAGC AAAAAGTCAA TAATTTATTA
AATACGGGAG CAACCTTAAT TGCCTCTGCT AACCCTGGAT GTTCTTTGCA GATTAAAAAA
CATTTACGGT TACAGGGGAA AGAAATAACA TTAATGCACC CGATAGAGTT GTTAGATTAT
TCCATTAGAG GAATACAATT ATAA
 
Protein sequence
MQTEQINLTP KIEPNISGFD DKNPPPKELI DACVHCGFCL STCPSYRVIG KEMDSPRGRI 
YLMNAIEQGE AAIDETTSQH FDSCLGCLAC VSTCPSGVQY DQLIAKTRPQ VERNQPRTLK
DKIIRSLIFN IFPYPQRLQI FLPLLWLYQK LGIQKLIRAT GLFKKFFPRL AAMDSILPEI
TLKKSQTFPD IIPCQGTKRY RVGMILGCVQ RLFFSPVNEA TARVLTVNGC EVVIPKTQGC
CAALPEHQGQ EEQAQTLAKQ MIDSFAETDV DYIIINAAGC GHTLKEYGHI LADDPDYREK
AKQFSEKVKD VQEFLAEVGL TAPLNPLTEG KLMMVYQDAC HLLHGQKISL QPRQLLQKIP
GIILKEPVDA ALCCGSAGVY NMLQPEVAEE LGQQKVNNLL NTGATLIASA NPGCSLQIKK
HLRLQGKEIT LMHPIELLDY SIRGIQL