Gene A9601_08721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_08721 
Symbol 
ID4717577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp754389 
End bp755711 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content28% 
IMG OID640078584 
ProductRieske iron-sulfur protein 2Fe-2S subunit 
Protein accessionYP_001009263 
Protein GI123968405 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA GACAAATTAA TTTTTTTAAA TCAAAAGACT TTAATACCGT TCTTAAGCCA 
TTTAAAAAAG GAACGGTAGT AAAAATTGAC TCGTTAGATG TTAAGGAAAG CCCAAATGAA
TTAAAAATAG GTTTATTTGG TTGGTATGCA ATTTGTCCTT CGAAAGAACT AAAAAATAAT
AAGCTATATT ATTTTTCACT TTTTGATGAG CCTCTTGTTC TTTATAGAGA TGAAAATGAA
AACGTTAGGT GCATTAAAAA TATTTGTCCA CATAGGGGGG CCTCTTTTTT TGAGGGGTCA
TTATCAGATG GAGTAATAAC TTGTCCATAT CATGGAGCTA AATTCTCATC TGGTGGAAGT
TGCCAAAATC TCGACAGAAT AACATGTAAA CATATAGTAG ATAATAACTA CGATAACTAC
GCCAAAAGAA TACACTTATC TCAATACAAA GCTTTAGAAA AAAATGGATA TATTTTTGTA
CATTTTTCTA AAAAATCTGA CACTGATTTA AGTAACATAA GTGAAGATGT GCCAATAAGT
AATTACGAAT TATATGAAAA TGGGTTTTTG CATAAAGATT ATGTATTTGA AGAGGTATTA
GTTGACTTTA AATGTGATTG GTCAAGGATA ATTGAAAACC ACTTAGATAT CCTCCATCTT
TTTTGGGTTC ATGGAGATAC AATCCCTGAT AAAGATGTTA ATAAAAATGT TCTAGTTAGT
TTTAACCAGA AAATTAATGT GACTCATAAA TACATTGAAA GTATTTATTA CTACAAGAAT
GAACCTACAA AAGAATTTAT CCGGATAAAG TACATACCAC CAGGAAGGAT ATTAATTTAT
AAAGGTGATC CTTCTGCAGC TAGATATTTA CAAGTTTTAG ATCATATTCC ACTAGGAAAT
AACAAAGCAA GAGTAATAGT GAGACATTAC AGGAAATTTT TAAGAAATAA ATTACTTAAT
AACCTCATGT TGTTTAAAGA GAACCAAAGA AAGATTTTTT ATAAGATATT CGATGAGGAT
TATATGATTT TAAAAACACA AACATATAAC CACAATATGG GATTTATAAG TAAAGATGAA
ATAAAATTAT TGGGAGAAGA TAGAATAATC AATTATTTTT GGAAATGGTA CAAAAAGTCT
GAAGATAAAG ATGAACCATG GAAAAATAAT ATAAAAAATC AAAATCTTGA TGTCTATGAC
AAAGTGATAT TGAAATATCC TCCTGAGATA AAGAAGTTAG AAATTGTTAA TAATATAGAA
ATTATTAGAA AAACATTAGT ACGATTTGCT GCTCCGCTTA TATTTTTTAT CTTAATAATA
TAA
 
Protein sequence
MENRQINFFK SKDFNTVLKP FKKGTVVKID SLDVKESPNE LKIGLFGWYA ICPSKELKNN 
KLYYFSLFDE PLVLYRDENE NVRCIKNICP HRGASFFEGS LSDGVITCPY HGAKFSSGGS
CQNLDRITCK HIVDNNYDNY AKRIHLSQYK ALEKNGYIFV HFSKKSDTDL SNISEDVPIS
NYELYENGFL HKDYVFEEVL VDFKCDWSRI IENHLDILHL FWVHGDTIPD KDVNKNVLVS
FNQKINVTHK YIESIYYYKN EPTKEFIRIK YIPPGRILIY KGDPSAARYL QVLDHIPLGN
NKARVIVRHY RKFLRNKLLN NLMLFKENQR KIFYKIFDED YMILKTQTYN HNMGFISKDE
IKLLGEDRII NYFWKWYKKS EDKDEPWKNN IKNQNLDVYD KVILKYPPEI KKLEIVNNIE
IIRKTLVRFA APLIFFILII