Gene P9303_17441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17441 
Symbol 
ID4778113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1524788 
End bp1526464 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content54% 
IMG OID640087251 
Producthypothetical protein 
Protein accessionYP_001017751 
Protein GI124023444 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAATT TGGCGGACAG CACATCCACT CAGATCCTGT TGCTGGCTCC CGATCTGCTC 
GGTGAATCCT TGGCGTTGCA GCTCAGCAGT GCAAACCCAA ACCTGGATGT CATTCTGCGG
ACGGACCAGC TGAGCCGTCA TCCAGTCCTA GTGATCTTGT CGGTGGAGAG TCTCGAAACT
CTCAGCACAT TGCAACTGGA ACTGAAAAGA CTTCAGGAGC ATTGGCAACC TGCGCCAGTA
ATGCTGATCC TTCCGGCCCA GCTTCGTTTC AATGCCAACG AGTTGTTGAG CCTCGATTGT
CCAGGCCTAC TTCAAGACCC TGATCTGGCC ACATTGCAGG ATGCCATCAC CACCCTTTGT
GCCGGGGGCA GAGTTGTGAG ACTCAATGCT GCCCCCGTCT CCCAAGACAG CATTCCCCAA
GCAACGATGG GACTTGGTCA GTGGCTGTTG GTGAGTGGTC TCCAGCAGAT CGACAACGAT
CTCAGGTTGA TCGAAGCATT CCTCAACCCA CCGCCCCAAA ATGAACTGTT TCGTCTTTTG
ATGGAAGGGC GCCAGCGGGA ACTCCGCAGT GCAAGAGATT TTTTGTTATG GATTTGGGGC
CCGCTGCAAG TGGGACTACG CAACCCTTTC CCCCCAAACC GGCCAGCCCA AAAGGCACGC
ATCAATTTCG ATTTCGATGC TTCAAGACAG ATCTCAGCAG AAGCTGCTGG AACGGTGATC
TGCCTCACTG AACGGAACGC AGTGGCGGTA TGGGGAGCGA TCCGTCAGCG GCTTAGCGAT
TCCGTAGAAA GAGGACTCAG AAATTCAACA GGCAGCCTGC TAGCGATTGA AAGCCTCAAT
CCTGAGCGAC GTCGTGATCT ACTCCTTGCC CTTTTGAACC AATTGGATCA AGTGATGAAA
AGGCTGCGTC AAGCCAACAG TACCGAGACC CCACTCAATG ACTCCTGGCT GACGCTCGAA
TCTGAACTAC GTGAACAAGC CCTGCGATCC ATGGCAGGGA ATTACGTCAG ACTTCCTCGA
GGTGGTGAAC TCAAGCCAGT GGCAGATCAA TTGCTTGCCA CTGCCGATCT CAAGGGAATT
GATCAAGAAC TTCCAGATCC CCAGAGAATG CTGGCTCCAT TGCTGCTCGA CAGACCCGTG
CTTGTTGAGG GGCAGCTGCT GCCGGCAGAT GCCCCTCGGG CCTTACTGCA ACTCGAAATG
TTGGTGGGTA ACTGGCTCGT ACGAACCGCT GAAATAATCA GTGCCGAAGT TCTCGGAACC
TGTGGTGAGT GGCCCGAACT ACGCCGATTC CTACTTAACC AGCACTTGAT CTCCACACGA
GAACTCGAGC GATTGCGTAA TCAACTCAAT AGTCAGGCCC GTTGGCAAAA CTGGATTCAA
AGACCAATCC AGCTCTACGA AAGTAAACGA CTCCTCTACA GGTTGCGAGA CGGCATTATC
GAGCCATTAC TGCTCACCGA ACCTCGTGAT GAAGAACTCA GCCAGCTTGG TTGGTGGCAG
CAGCAAGTTG CTCTATTGCT CGAAGCCCGC GATGCTCTGG CACCCTCAAT GCAATCCCTG
ATCAAACGCA TTGGTGATCT CATGGTGGTC GTGCTTACTC AGGTATTAGG CCGTGCTATT
GGTCTGGTTG GACGAGGAAT CGCGCAGGGA ATGGGACGCA GCCTGAGAGG TGGCTAA
 
Protein sequence
MVNLADSTST QILLLAPDLL GESLALQLSS ANPNLDVILR TDQLSRHPVL VILSVESLET 
LSTLQLELKR LQEHWQPAPV MLILPAQLRF NANELLSLDC PGLLQDPDLA TLQDAITTLC
AGGRVVRLNA APVSQDSIPQ ATMGLGQWLL VSGLQQIDND LRLIEAFLNP PPQNELFRLL
MEGRQRELRS ARDFLLWIWG PLQVGLRNPF PPNRPAQKAR INFDFDASRQ ISAEAAGTVI
CLTERNAVAV WGAIRQRLSD SVERGLRNST GSLLAIESLN PERRRDLLLA LLNQLDQVMK
RLRQANSTET PLNDSWLTLE SELREQALRS MAGNYVRLPR GGELKPVADQ LLATADLKGI
DQELPDPQRM LAPLLLDRPV LVEGQLLPAD APRALLQLEM LVGNWLVRTA EIISAEVLGT
CGEWPELRRF LLNQHLISTR ELERLRNQLN SQARWQNWIQ RPIQLYESKR LLYRLRDGII
EPLLLTEPRD EELSQLGWWQ QQVALLLEAR DALAPSMQSL IKRIGDLMVV VLTQVLGRAI
GLVGRGIAQG MGRSLRGG