Gene P9211_03631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03631 
Symbol 
ID5731703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp340145 
End bp341599 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content42% 
IMG OID641284712 
Productputative neutral invertase-like protein 
Protein accessionYP_001550248 
Protein GI159902904 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.072592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTC GTTTTAGTCA ACAGCACAAA AGGTTGAGAC CCAACTCTAA TGAGGATGCG 
GTTATCAAAA GAGCTCAAGA GCATTTCGAG AGATCACTAG TTGAAATTTC TGGGAGCATT
TCAGGCAGTG TTGCAGCTCT TGAGCATCCA GCAAACAATG ATGCATTGAA TTATGGAGAG
ATCTTCCTTA GGGACAACGT GCCCGTAATG ATTTATCTAT TGACGCAAAA TAGATATGAC
ATAGTCAAAA AATTCTTAAC TGTATGCCTT GATCTTCAGA GTACTACTTA TCAAACAAGA
GGTATTTTCC CTACTAGCTT TGTTGAAGAG AACGGAGAAC TTATTGCAGA CTATGGGCAA
AGATCGATAG GAAGAATTAC ATCTGCAGAT GCGAGTTTAT GGTGGCCAAT CTTGTGCTGG
TTATATGTCA GGAAAAGCAA AGATACCAAT TTTGGAGTCA GTCAGCAAGT GCAGCGTGGA
GTGCAACTGC TTTTAGACCT AGTTTTACAT CCCACTTTTG AAGGAACTCC TGTCCTTTTT
GTACCTGACT GTTCATTCAT GATTGACCGT CCCATGGATG TATGGGGTGC TCCCTTAGAA
GTTGAAGTTC TCTTATATGC ATGCTTAAGC AGTTGTATAG AACTTATGGA TCTAAGCAGC
AAGCACCAAG TAAGCCGCCT GCTAGACCAA AGGCTTCTTC TTACAAGGCA ATGGGTACAT
GATCTGAGGC AATTTCTTCT TAAGCACTAT TGGGTCACCA GTAAAACAAT GCAAGTACTA
AGGAGAAGGC CTACAGAACA ATATGGAGAA GACCAGCATC AAAATGAATT TAATGTTCAA
CCTCAAGTTG TCCCCTCTTG GCTGCAAGAT TGGCTAGAGA ATCGTGGTGG ATATCTGATT
GGGAATATTC GAACTGGACG ACCTGATTTT CGTTTTTATA GTCTTGGAAA TTCGCTTGCA
TGCATGTTCG GAGTATTAAC TGCTCCACAA CAGAGAGCAC TCTTCAGATT GGTTCTTCAC
AACCGACAAC ATCTAATGGC ACAAATGCCT ATGCGAATAT GCCATCCCCC AATGGAAGTA
GAAGAATGGC AAAACAAAAC TGGATCTGAT CCTAAGAATT GGCCATGGAG CTATCACAAT
GGAGGGCATT GGCCCAGCAT TCTTTGGTTC TTTGGAGCAT CAATACTTAT GCACGAGAAA
AGATATCCCA AAGCAGATGT TCTGTTGATG GGCCAAATGC GCACTCTGTT GGAAGAATGT
TACTGGAGTC AATTAAATCA ACTTCCAAAA CAAAAATGGG CCGAGTATTT TGATGGTCCT
ACAGGAACCT GGGTAGGTCA ACAATCAAGG ACATACCAAA CTTGGACAAT TGTTGGCTTT
CTTCTTTTAC ATCACTTTCT CAAAGTATGT CCTGATGATA TATCAATGCT CGATCTAGAC
TTAGAAAAGA CCTAG
 
Protein sequence
MAGRFSQQHK RLRPNSNEDA VIKRAQEHFE RSLVEISGSI SGSVAALEHP ANNDALNYGE 
IFLRDNVPVM IYLLTQNRYD IVKKFLTVCL DLQSTTYQTR GIFPTSFVEE NGELIADYGQ
RSIGRITSAD ASLWWPILCW LYVRKSKDTN FGVSQQVQRG VQLLLDLVLH PTFEGTPVLF
VPDCSFMIDR PMDVWGAPLE VEVLLYACLS SCIELMDLSS KHQVSRLLDQ RLLLTRQWVH
DLRQFLLKHY WVTSKTMQVL RRRPTEQYGE DQHQNEFNVQ PQVVPSWLQD WLENRGGYLI
GNIRTGRPDF RFYSLGNSLA CMFGVLTAPQ QRALFRLVLH NRQHLMAQMP MRICHPPMEV
EEWQNKTGSD PKNWPWSYHN GGHWPSILWF FGASILMHEK RYPKADVLLM GQMRTLLEEC
YWSQLNQLPK QKWAEYFDGP TGTWVGQQSR TYQTWTIVGF LLLHHFLKVC PDDISMLDLD
LEKT