Gene P9303_12771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_12771 
Symbol 
ID4777319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1097629 
End bp1098789 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID640086785 
Producthypothetical protein 
Protein accessionYP_001017289 
Protein GI124022982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0568502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC TCAGCAACAC CGAACCCTGG CTAAAAACCT TCCGCAATCT CGTCCGCAAG 
ACGACTGCTG AGGATTGGTG GATTGTGAAA TCTGGCAATC GCATACGGCT TCAGGTTCCT
GGTGTTGGCA GCAAAGTGTT GCCCTACGAC TGGTCGGAAG AAGGTGCAGC CCATGCCTTG
CCCCGCATTC AGCAGATCTT TAAGCGTTGG GCTGATGGAA ACATCACACT TGCAGAGGCA
GCGCAAACTG CTGATACCAG CAGTTCCAAG CAGCAACTGA ACTTCGACGA GCTGATCGAG
AGCTATAGAA AGTTCGTCCC CAACGCTGGC GATAAAACCT GGAAGAAGAA TTATCTGCCC
GTACTGCGCA ATTGCCGAGA CAAGTTCAAA GGAACCCCAC CACGTGATGG CGAAATACTC
TGCATGGGAT GCCTTGAGCA ATGGGAGCAA GGCTCCAGGT CCCGTCAGAT CAGCCGTCAA
AAGCTTTATG GCTTTCTGAC TTGGGCGGTG CAACGTGGTC ACCTAAAAAC GATCTACTTG
CCTCCTACGT CCCTGCCAGA AGTGCGTAAG GCCAAGCGCG TTGGCTATGC CATTTCAGAT
GTAGAGATCC TCAGGTTGTT GGAAGGAATG CCTGATCCAC GTTGGCAATT TGCTGTTCAG
CTCTGCAGCG TCTATGGCCT CAGGCCAGAA GAATTGCGAT GGCTACGGAT CAAAAACGGG
GCAAAGGGTT CTGAACTGTG GACCATCTAT CAAAAGTCGA TGGGCGGCAG GAAAGGAGAT
AAGACAGAAC CGCGTCGCTT GCTACCGCTA CTCGTTCGTG ATCTTGACGG CTCTTCCATT
GATTGGAAGT TGCAAGCCCG GCTTCAAGTT GGCGAAAAGT TACCCCCACT GCAGAGCGAT
GGCGATGGAG CACAGGCATT AAGGAACTAC CTACGTCGGC GTGAGGTTTG GAGGAGCTTG
AAAACTGAGG CGTTGAACAC AGGTGAACAG CTCACGACGT ATTCCTTTAG GCATCGCTAT
GCCAAGGCTT CACATGCAGC CGGTTTGCCT GTGGCCAATA TCGCTGAGGC CATGGGGCAC
ACGATTGAAG TGCATCTCGG TAGTTACGCC AGGTTCAAAC CAGATGCAAC AGCAGACCTT
TATGCGCAGG TGAACGCTTA A
 
Protein sequence
MGKLSNTEPW LKTFRNLVRK TTAEDWWIVK SGNRIRLQVP GVGSKVLPYD WSEEGAAHAL 
PRIQQIFKRW ADGNITLAEA AQTADTSSSK QQLNFDELIE SYRKFVPNAG DKTWKKNYLP
VLRNCRDKFK GTPPRDGEIL CMGCLEQWEQ GSRSRQISRQ KLYGFLTWAV QRGHLKTIYL
PPTSLPEVRK AKRVGYAISD VEILRLLEGM PDPRWQFAVQ LCSVYGLRPE ELRWLRIKNG
AKGSELWTIY QKSMGGRKGD KTEPRRLLPL LVRDLDGSSI DWKLQARLQV GEKLPPLQSD
GDGAQALRNY LRRREVWRSL KTEALNTGEQ LTTYSFRHRY AKASHAAGLP VANIAEAMGH
TIEVHLGSYA RFKPDATADL YAQVNA