Gene PCC8801_2847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2847 
Symbol 
ID7104372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2937498 
End bp2938514 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content41% 
IMG OID643475883 
Producthopanoid biosynthesis associated radical SAM protein HpnH 
Protein accessionYP_002373002 
Protein GI218247631 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID[TIGR03470] hopanoid biosynthesis associated radical SAM protein HpnH 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTTC AATTGCGACA AGCACTCAAG GTTGGCACTT ATATTATTAG TCAGCGTTTA 
TCGGGTCGTC AACGCTACCC CCTGGTATTG ATGCTAGAAC CCCTATTTCG CTGTAATTTA
GCCTGTTCAG GTTGCGGAAA AATTCAGCAT CCTCCCGAAA TTTTAACCCG TAATTTGACC
CCCGAAGAAT GTTTTACAGC AGTAGAAGAA TGTGGTGCTC CGGTGGTGTC TATTCCTGGG
GGAGAACCCT TGTTACATCC CCAAATTGAT GAAATTGTTA AGGGGTTAGT CCAACGGAAA
AAGTTTGTCT ATCTATGTAC TAATGCAATT TTACTAGAAA AAAGCCTCGA TAAATTTGAA
CCGTCTCCCT ATCTAACCTT TAGTGTTCAC CTCGATGGGT TACGGGAACA TCATGATAAA
TGTGTTGATC GTCAAGGGGT ATTTGATAAA GCGATTCAGG GTATTCGTGC TGCTAAAGAA
AAGGGATTTC GTGTAACAAC AAATACGACC ATTTTTGAAG GAACCGATCC TCAAGAAATG
CAGGAATTTT TTGACTTTCT GGAAACCTTG GGAACTGATG GTATGATGAT TTCTCCAGGG
TATAGTTACG AATGGGCTCC CGATCAAGAA CACTTTCTTA AACGGGAACA AACCAAGGCA
TTATTTCAAC AAATTTTGAT GCCTTGGAAG ACAGGGAAAA AGCGTTGGAA TTTTAATCAC
AATCCCCTAT TTTTAGATTT TCTGTTAGGA GAAAAAGACT ACGAATGTAC TCCTTGGGGA
AGTCCGAGTT ATAGTGTTTT GGGATGGCAA AAACCCTGTT ATTTGCTCAA TGAAGGACAC
TATAAAACCT TCAAAGAACT GTTAGAAGAA ACCAACTGGG AAAACTATGG ACGCAAGAGT
GGTAATCCTA AATGTGCTGA CTGTATGGTA CATTGCGGAT ATGAACCCAC GGCTGCCGTT
GATGCCATGA ATCCTGCTAA CATGGGACGA GCATTAGAAA GTTTGTTTAG TGCGTAA
 
Protein sequence
MAVQLRQALK VGTYIISQRL SGRQRYPLVL MLEPLFRCNL ACSGCGKIQH PPEILTRNLT 
PEECFTAVEE CGAPVVSIPG GEPLLHPQID EIVKGLVQRK KFVYLCTNAI LLEKSLDKFE
PSPYLTFSVH LDGLREHHDK CVDRQGVFDK AIQGIRAAKE KGFRVTTNTT IFEGTDPQEM
QEFFDFLETL GTDGMMISPG YSYEWAPDQE HFLKREQTKA LFQQILMPWK TGKKRWNFNH
NPLFLDFLLG EKDYECTPWG SPSYSVLGWQ KPCYLLNEGH YKTFKELLEE TNWENYGRKS
GNPKCADCMV HCGYEPTAAV DAMNPANMGR ALESLFSA