Gene PCC8801_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3212 
Symbol 
ID7103940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3355148 
End bp3356110 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content47% 
IMG OID643476234 
ProductNADH ubiquinone oxidoreductase 20 kDa subunit 
Protein accessionYP_002373344 
Protein GI218247973 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAATG TTCTTTGGCT ACAAGGTGGT GCTTGCAGTG GGAATACCAT ATCATTCCTG 
AATGCAGAAG AGCCCACCAT TGTTGATTTG ATTACGGATT TTGGTATTAA TGTTCTCTGG
CATCCATCCC TCGGACTGGA ATTGGGAGAC AGCTTACAGC AACTCCTAAG AGACTGTGTT
AGTGGCAAAA TTGCCGTTGA TATCCTGGTT TTTGAAGGAA GTGTGGTTAA TGCACCCCAT
GGAACCGGAG AATGGAATCG GTTTGCTGGC CGTCCCATGA AAGACTGGTT AGCGGACTTA
TCCAAAATTG CCGGGTTCGT TGTGGCTGTA GGAGACTGTG CCACCTACGG GGGTATTCCA
GCGATGGAAC CTAACCCCAG TGAGTCCATT GGAGTACAAT TCCTTAAACG CAAAGAAGGA
GGCTTTTTAG GGGCAGATTA CCGTTCCCAA GCGGGACTCC CTGTCATTAA TATACCCGGT
TGTCCGGCGC ATCCTGACTG GATTAGTCAA ATTTTAGTCG CGGTAGCTAC GGGACGGGTA
GGAGACATCA CCCTTGATGA GTTTCACCGT CCTGAAACCT TCTTCAAGTC CTTTACCCAG
ACGGGTTGTA CTCGCAATAT GCACTTTAGC TATAAAGCGA CAACTCAGGA CTTTGGACAG
CGTACGGGAT GTCTCTTCTA TGATATGGGC TGTCGTGGTC CGATGACCCA TTCTTCGTGT
AATAGAATCC TCTGGAACCG AGTTTCGTCC AAAACTCGCG CGGGAATGCC CTGTTTAGGC
TGTACTGAAC CGGAATTTCC CTTCCATGAT CTTAAACCAG GAACTGTCTT TAAGACCCAA
ACGGTGATGG GTGTTCCTAA AGAATTACCC CCAGGGGTCA ACAAAAAAGA TTATGCCTTA
TTAACGGTTG TTGCTAAAGA TGCCAGTCCA TCTTGGACAA ACGATGATAT GTTCACCGTC
TAA
 
Protein sequence
MANVLWLQGG ACSGNTISFL NAEEPTIVDL ITDFGINVLW HPSLGLELGD SLQQLLRDCV 
SGKIAVDILV FEGSVVNAPH GTGEWNRFAG RPMKDWLADL SKIAGFVVAV GDCATYGGIP
AMEPNPSESI GVQFLKRKEG GFLGADYRSQ AGLPVINIPG CPAHPDWISQ ILVAVATGRV
GDITLDEFHR PETFFKSFTQ TGCTRNMHFS YKATTQDFGQ RTGCLFYDMG CRGPMTHSSC
NRILWNRVSS KTRAGMPCLG CTEPEFPFHD LKPGTVFKTQ TVMGVPKELP PGVNKKDYAL
LTVVAKDASP SWTNDDMFTV