Gene P9303_21831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21831 
Symbol 
ID4777794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1940907 
End bp1942373 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content55% 
IMG OID640087698 
Productputative neutral invertase-like protein 
Protein accessionYP_001018183 
Protein GI124023876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.28197 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGAC GCTTAAACCA GCAAAACCAG AGGGTTCGTC CCAACTCCAA TGAAGACCAG 
GTGGTGCAGC AGGTCAAGGA GCATTTCGAA CGCACGCTGA TCGAAGTAGG CGGCACTGTG
GCAGGCAGCG TGGCGGCTCT TGAGCATCAG CCCCATAACA AAGCTCTCAA TTACGGCGAG
GTCTTCCTGC GCGACAACGT GCCCGTGATG ATCTACCTGC TCACCCAAAA GCGTTACAAG
GAGGTGAAGC AATTCCTCAG CGTTTGCCTA GATCTGCAAA GCACCACATA TCAGACACGC
GGGGTGTTCC CCACCAGCTT CGTTGAGGAA CAAGGCGAGC TCATTGCTGA CTATGGCCAG
CGCTCAATCG GCAGGATCAC CTCTGTGGAC GCAAGCCTTT GGTGGCCCAT TTTGTGCTGG
CTTTATGTCA AGCGCAGCGG TGACAAAAAC TTCGGCACCA ACCAAAAGGT CCAACGAGGT
GTGCAGCTGA TGCTCGATCT CGTGCTGCAT CCCACCTTCG AGGGCACTCC AGTTCTATTT
GTTCAGGATT GCTCATTCAT GATTGACCGC CCCATGGATG TTTGGGGGGC ACCACTTGAA
GTGGAAGTAC TGCTCTATGC CTGCCTGCGC AGCTGCATCG AGCTGATGGA ACTGAGTCGC
AAGAATCATG TCAGTCGGCT TCTCGACCAA CGTCTCTTAC TGACCCGCCA ATGGGTGCAT
GACCTACGTC AATTCCTGCT CAAGCACTAC TGGGTCACCA GCAAGACCAT GCAGGTGCTG
CGAAGAAGAC CCACAGAGCA ATACGGCGAC AACCAGCACC AAAACGAGTT CAACGTGCAG
CCACAGGTCG TTCCCGACTG GCTTCAAGAC TGGCTTGAAA ACCGGGGTGG CTATCTCATC
GGCAACATCC GCACCGGACG ACCTGACTTT CGTTTTTACA GTCTCGGAAA CTCCCTAGCC
TGCCTGTTTG GGCTACTGAC CGCCCCACAA CAGCGCGCTC TGTTCCGACT GGTGCTTCAC
AACCGACAGC ATCTGATGGC TCAGATGCCA ATGCGCATCT GTCATCCACC CATGGAAGGG
GCGGAATGGC AGAACAAGAC AGGTTCAGAC CCGAAAAACT GGCCATGGAG CTATCACAAC
GGCGGACACT GGCCCAGCCT GCTCTGGTTC TTCGGAGCCT CAATCTTGTT ACATGAACGG
CGTCATCCTG AGGCCGACGT GCTGCTGATG GGTGAAATGC GCGCCCTACT GGAGGAGTGC
TACTGGAGCC AATTAAACCA ACTGCCGCGT CAAAAATGGG CCGAATACTT TGATGGACCA
ACAGGCACCT GGGTAGGACA GCAGTCAAGG ACCTACCAGA CCTGGACCAT GGTGGGCTTT
CTACTTCTCC ACCACCTTCT ACGCGTCTGC CCCGACGACG TGTTGTGGTT GGATCTCGAC
GAACTGCTGC CAACACATGA GGTGTAA
 
Protein sequence
MAGRLNQQNQ RVRPNSNEDQ VVQQVKEHFE RTLIEVGGTV AGSVAALEHQ PHNKALNYGE 
VFLRDNVPVM IYLLTQKRYK EVKQFLSVCL DLQSTTYQTR GVFPTSFVEE QGELIADYGQ
RSIGRITSVD ASLWWPILCW LYVKRSGDKN FGTNQKVQRG VQLMLDLVLH PTFEGTPVLF
VQDCSFMIDR PMDVWGAPLE VEVLLYACLR SCIELMELSR KNHVSRLLDQ RLLLTRQWVH
DLRQFLLKHY WVTSKTMQVL RRRPTEQYGD NQHQNEFNVQ PQVVPDWLQD WLENRGGYLI
GNIRTGRPDF RFYSLGNSLA CLFGLLTAPQ QRALFRLVLH NRQHLMAQMP MRICHPPMEG
AEWQNKTGSD PKNWPWSYHN GGHWPSLLWF FGASILLHER RHPEADVLLM GEMRALLEEC
YWSQLNQLPR QKWAEYFDGP TGTWVGQQSR TYQTWTMVGF LLLHHLLRVC PDDVLWLDLD
ELLPTHEV