Gene P9515_09991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_09991 
Symbol 
ID4719147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp892683 
End bp894173 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content24% 
IMG OID640080679 
ProductGTPase SAR1 and related small G proteins 
Protein accessionYP_001011313 
Protein GI123966232 
COG category[R] General function prediction only 
COG ID[COG1100] GTPase SAR1 and related small G proteins 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAGT TTATATTTAA ATACTATAAA TACTTATTTA TTTTAATATT TATTTATTTT 
CTATTATCCT TACTTAGAAA TATTTTTAAT CTATATTTCT TTATGATTTT ACTTATTATT
TTATCGTATT TTTATAATAA AAATAAAAAA GTATTTAAAA AAGTTGTATA TAAGATTATT
TTTAATAATA AGAAAAATAT TCCGTTCAAG AATAATTATG GTGCTGCAAT AAATATTCTT
GAAAGTATAG AAGAAATTAA TAAAAAAATA TCTAATAAAG TAGAGTCAGA GTTATTAATA
TACGAAAAAA ATAAATTAAC AGAACAATTA AAATATGGAG ATTACAATGT TATTTTATTT
GGAGCTGGAT CCTCTGGAAA AACATCAATA GCAAGAGCAT TATTAAAAAA CTTAATTGGA
AAAATTTCGC CAACAATTGG AACCACAAAA AATATTGCAA GTTATAAAAT CAGAATCCCA
ATTCTAAAAA GAAATATAAA TATAATTGAT ACTCCAGGCT TATTTGAGGC ATCAATAGAT
GGAGAAAAAA GAGAAAAATC TACAATAATT GAGGCATCAA AATCAGATCT TATCCTTTTT
GTTTTAGATC AGGACATTAA TAAGTTTGAA CTATATTTAA TTAGAGAATT ACTAGAGTTG
AGAAAAAAAA TAATAATTGT ACTAAATAAA TGTGATTTAA GATCAGAGAA ACAAAATAAT
AATATTAAAG AAAATATTAT TTCAATGACA TCCTCAAAAA AAATAAAAAT CTCAGTAGTA
AAAACCATTG CTTCAAATAA TTTATCTCCA AACCATTCAT TAGGCTCAAT AGATGTTAGT
AATTTATTTA AAGAAGTAAT AGAAACCCTT GATGAGAACG GAGAAGAGTT ATTAGCTGAT
AATATTCTTT TTAGATGTAA TAAATTAGGT CTAATTAGTA AAAAAGTAAT TTCTGAACAA
AGAGAGTCAA GTGCTATAAG GGTAATAAAT AAATATACTT GGATAACCGG AGGAGTAATA
CTGGTTAACC CTTTGCCTGT TGTTGATTTT ATAACAACAA CATCTGTAAA TGTTCAAATG
ATCCTTGAGA TTTCAAAAAT ATATAATGTC AGGCTAAGTA AAATTGAAGC AGTTGATTTA
TCAAAATCAT TAATAACTAC ACTTGCCAAA CTAGGAATAT TAAAAGGGGG TTTGAATGTT
ATTACAACTG CATTATCTTC TAATTTTACT ACCATCTATA TTTCAAAATC AATACAATCT
TTAACCTCAT GTTGGTTAAT AAAAATAGTT GGTTTATCAA TAATTGAATA TTTTAAAAAT
GGACAAAATT GGGGAGATGC CGGGATTCAA GAAGTTATTG ATGATATTTA CAAATTAAAT
AAACGTGAAC AATTTTTAAA TAAATTTATT AAAGAAGCAA TTAATAAAAT CAACATTAAA
GAAGATAATC AATCTCAAAG AAAACTACCG CCATATTTTC AGAAAGATTA G
 
Protein sequence
MYKFIFKYYK YLFILIFIYF LLSLLRNIFN LYFFMILLII LSYFYNKNKK VFKKVVYKII 
FNNKKNIPFK NNYGAAINIL ESIEEINKKI SNKVESELLI YEKNKLTEQL KYGDYNVILF
GAGSSGKTSI ARALLKNLIG KISPTIGTTK NIASYKIRIP ILKRNINIID TPGLFEASID
GEKREKSTII EASKSDLILF VLDQDINKFE LYLIRELLEL RKKIIIVLNK CDLRSEKQNN
NIKENIISMT SSKKIKISVV KTIASNNLSP NHSLGSIDVS NLFKEVIETL DENGEELLAD
NILFRCNKLG LISKKVISEQ RESSAIRVIN KYTWITGGVI LVNPLPVVDF ITTTSVNVQM
ILEISKIYNV RLSKIEAVDL SKSLITTLAK LGILKGGLNV ITTALSSNFT TIYISKSIQS
LTSCWLIKIV GLSIIEYFKN GQNWGDAGIQ EVIDDIYKLN KREQFLNKFI KEAINKINIK
EDNQSQRKLP PYFQKD