Gene P9211_11131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11131 
SymbolpetH 
ID5730423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1015256 
End bp1016341 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content41% 
IMG OID641285481 
Productferredoxin-NADP oxidoreductase (FNR) 
Protein accessionYP_001550998 
Protein GI159903654 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.93922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTACT CGGAAGCAAA TGTAATCGCC GGTGGCCTGG CGCATATACC TGTTCTCATT 
GGGGTATTTG GCTTTATTCA GTCATTTATT CTCAAGCGAA CCCAAGCAAA AGGCACATCC
AATCAGCCCA GTACTCAGAC GAAGCCAGCC TCATCAGTGG CTTCTTCACA GCCAAAAGTA
ATTAAAAAGC CAGCCCATCC AAATGTTCCC GTTAATACCT ATAAACCAAA GACCCCTTTT
ATTGGGACTG TTAAAGAGAA CTACTCACTA TTGAAATCAG GTGCAATTGG TAGGGTTAAT
CACATAACCT TTGATCTATC TAGTGGAGAC CCTCTTCTTA AATACGTAGA AGGTCAAAGC
ATTGGAATAA TTCCTGCTGG CGAAGATGCT AATGGTAAAC CTCACAAAAT TCGGCTCTAT
TCAATAGCCA GTACAAGACA TGGTGATGAC TATAAAGGTA ATACAGTTTC TCTATGTGTC
CGTCAACTTC AATATGAAAA AGATGGCAAA ACTATTGATG GAGTCTGTTC AACTTATCTG
TGTGACATAA AGCCTGGAGA CAAGGTAAAA ATCACCGGAC CTGTTGGGAA AGAAATGCTT
CTTCCTGAAG ACGAGAATGC CAACATAATT ATGCTCGCCA CAGGTACAGG CATAGCCCCT
ATGAGAGCCT ATCTTCGAAG AATGTTTGAT CCAACAGAAC AAGAAAAAAA CAGCTGGAAC
TACAAAGGGA ATGCATGGCT GTTCATGGGT GCTCCAAAAA CTGCAAACCT TCTTTATGAC
TCTGATTTTG AAGGCTACAA GTCTAAATTC CCTAACAACC TACGTTATAC AAAAGCAATT
AGCAGGGAAC AAAAGAATGC CAGAGGTGGT CGCATGTACA TTCAAGATCG GGTACTTGAA
CACGCTGATG AGATATTTGC ATTGATTGAG AATCCAAAAA CTCATATTTA TCTTTGTGGT
TTAAAAGGAA TGGAACCTGG CATAGATGAA GCAATGACTC AAGCAGCAGC TTCAAAAGGC
TTGGTTTGGT CAGAATTAAG GCCTCAACTT AAGAAAGCAG GCAGATGGCA CGTTGAGACG
TATTAA
 
Protein sequence
MSYSEANVIA GGLAHIPVLI GVFGFIQSFI LKRTQAKGTS NQPSTQTKPA SSVASSQPKV 
IKKPAHPNVP VNTYKPKTPF IGTVKENYSL LKSGAIGRVN HITFDLSSGD PLLKYVEGQS
IGIIPAGEDA NGKPHKIRLY SIASTRHGDD YKGNTVSLCV RQLQYEKDGK TIDGVCSTYL
CDIKPGDKVK ITGPVGKEML LPEDENANII MLATGTGIAP MRAYLRRMFD PTEQEKNSWN
YKGNAWLFMG APKTANLLYD SDFEGYKSKF PNNLRYTKAI SREQKNARGG RMYIQDRVLE
HADEIFALIE NPKTHIYLCG LKGMEPGIDE AMTQAAASKG LVWSELRPQL KKAGRWHVET
Y