Gene Cyan8802_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3626 
Symbol 
ID8392968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3700030 
End bp3701118 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content35% 
IMG OID644981555 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_003139277 
Protein GI257061389 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.54925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAAA CTAATGGAAA AGTTACCCAT TTATCAGATA ATATTTATTC TTTAAAAGTT 
CCTACTGATT TACGCAAAGT TGGTTTAAAT CCTAATTTTT GGTATCCTTT AGCACAAGCA
AAAGATGTTA AAATTGAGAA ACCTTATGCA GTAAGTTTTG CGGGTAATCC TATCGTTTTA
ATTCGGACAA AATCTAATCA ACTTTTTGCC CTTGAAGATC GTTGTGCTCA TCGACAAGTT
CCTTTAAGCA TGGGGATAGT TTGTGGTAAT ACGATTCAAT GTACTTACCA TGCTTGGCAA
TACAATCAAA CAGGGAAAGT CGCTAAAGTT CCCTATTTAC CGCAGGGATG TCCTTTACCC
AAGGGGGTTA AAAGTTATCC TTGTCGAGAA GCTTATGGTC ATATTTTTGT CTTTCCTGGG
ACTGTAGAAT TAGCCGAAAA TGTACCATTT CCTGAGATAA AAAATTGGTC TGATTCCAAT
TATAAAACAA TGTATTTTTC TCGTCAAGTA AATTGTCATT ATTCATTCCT AAAAGAGAAT
TTAATGGATA TGAATCATCA ATTTTTACAT CGGAGATTTA TGGGAAAAGT CAAACCTACT
TTACTAGAAA ATTCAAAAGG AGATAATTGG GTAGAAGCTA AATTTAAGTT TAGTGGAGAC
CCCCATCCTA GCGCAAATTT GGTCTTAGCT GCGGGACGCA AACAAGAATC AGAATGTTCT
AATTCTGATT TTGATATTAT GACTATTCGG ACAGAATATC CCTATCAAAC TTTACGGGTT
TGTCCTCCAG GTTCTGATTT TCCCCTTCTT GATCTTTGGG TTGCTTATAT TCCCATTGAT
CGAAAACAAA AAAAGAATCA TAGTTTTGGA ATGCTGATGA TTCGTCAACC TAAAATCCCT
GGATTAATTC ATCTTTTATG GCCATTAATG CGTTATTTTA CTGGGGTTAT TTTTGCGGAA
GATAAAATGA TTGTTGAAGC GGAACAAACG GCTTATAATC TTCAAGGAGG GGACTGGAAT
CAAGAAGTCT TTCCTGTTCT TCTTGATGTT CGAGAATTAT TAACTAAAAA AGGAGTTCCT
ATCAGTTAA
 
Protein sequence
MVQTNGKVTH LSDNIYSLKV PTDLRKVGLN PNFWYPLAQA KDVKIEKPYA VSFAGNPIVL 
IRTKSNQLFA LEDRCAHRQV PLSMGIVCGN TIQCTYHAWQ YNQTGKVAKV PYLPQGCPLP
KGVKSYPCRE AYGHIFVFPG TVELAENVPF PEIKNWSDSN YKTMYFSRQV NCHYSFLKEN
LMDMNHQFLH RRFMGKVKPT LLENSKGDNW VEAKFKFSGD PHPSANLVLA AGRKQESECS
NSDFDIMTIR TEYPYQTLRV CPPGSDFPLL DLWVAYIPID RKQKKNHSFG MLMIRQPKIP
GLIHLLWPLM RYFTGVIFAE DKMIVEAEQT AYNLQGGDWN QEVFPVLLDV RELLTKKGVP
IS