Gene P9303_16501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16501 
Symbol 
ID4777445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1441515 
End bp1443113 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content57% 
IMG OID640087159 
ProductGTPase SAR1 and related small G proteins 
Protein accessionYP_001017659 
Protein GI124023352 
COG category[R] General function prediction only 
COG ID[COG1100] GTPase SAR1 and related small G proteins 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.569133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCCC GACAGCGGCT GCTCCTATGG ACCGCCTGCG CGCTGGTGGT CTTGCTCGTC 
ATCGGAGCTC TAGTGCAAGT GGTGAGGAAC CTGCTCTGGG ATTTGAGTTA CTTACTACCT
CCCTGGTTGC TAGGCCCGGT GTTTTTCCTA ACAGCAGGCC TCATCATCTT GATGATCTAT
CAAGTTGGAT GGCCCTGGTG GAAGGCTTTT AAGCGTCAGA ACTTAGAAAC TGCTCAAAAC
AATCAACGGC CCCTTTCCCC TCCAAGTAGC CGTCATCAAG CAGCCAAGCA AAGTCTTGAA
AGCATTGATC GCCTGCTCGA ACGCCTCCAA GACGATGTCA CTCGAGAAGG GCTTAAGCAA
GAGAGAGAAA GGGTGGCTGA TGAACTGGCC CGTGGTGATC TGATGGTGGT GGTGTTCGGT
ACTGGTTCCA GCGGCAAGAC ATCTCTGATC CGAGCACTCC TGAACGAAAT GGTGGGTGAG
GTTGGTGCAC CCATGGGATC CACAACCAGT AGCCAGATTT ATCGACTGCG TTTAAAGGGA
CTTGATCGAG GCCTTCAACT GGCTGACACC CCAGGAATTC TTGAAGCAGG CAGAGCCGGT
TTAAGCCGAG AGAAGGAAGC AAGGCAACGA GCCAGTAGAG CTGACCTGAT GGTGGTGGTT
GTGGACTGTG ATCTACGCGC TTCAGAGCTG GAGGTCATCA GTAGCCTCGC CAATCTCGGC
AAACGATTGC TCCTGGTTCT GAATAAATGC GATCTACGCG GTGAAGAAGA GGAGCGACGG
CTTTTGGCGC AGTTGCGAGG GCGATGCAAG GGCTTGCTTG AAGCTGAGGA TGTGATCTCC
TGTAGCGCTG CACCCCAGTC AGTGCCGCGT CCCGGCAAAC GACCTTTGCA GCCTCCGGCT
GAGGTCGACA ACCTGCTGCG TCGCCTTGCG TCAGTGCTAC ACGCCGATGG TGAAGAACTA
CTGGCAGACA ACATTCTGCT GCAATGTCGC CATCTAGGAG ATGCCGGTCG CCAGCTGCTG
GATCGACAAC GACAACATGA AGCGCGTCAG TGTGTTGATC GTTACAGCTG GATCAGTGGT
GGTGTCGTCG CTGCAACCCC CCTCCCAGGA GTGGATCTAT TGGGGACGGC AGCGGTGAAT
GCCCAAATGG TGATGGAGGT CGCCCGGGTC TATGGAGTTC AACTCACTCG CAACCGAGCA
CAAGAACTGG CGGTATCAGT AGGCCGCACT TTGGCAGGAC TCGGCATTGT TAAAGGTGGA
GTGGCGATCA TCGGCACAGC TCTCAGTGTC AACTTGCCCA CCCTTTTGCT GGGCCGAGCG
GTACAAGGGG TCGCTGCTGC TTGGCTCACA CGCGTTGCGG GAGCGAGCTT CATGACCTAC
TTCCAGCAGG ATCAAGACTG GGGGGATGGC GGCATGCAGG AAGTGGTTCA ACGTCACTAC
GATCTCAACC GACGAGAATC TTCGCTGGAA CGTTTTCTCA CGACAGCCCT GCGGCGGGTG
GTGGAGCCTC TACAGCGGGA GAAACGGCGA CAGCTCCCGC CACGCCCAGG GCCTCGGGAG
GTGGCGGACG CATCGGACCA CGGGCATCCA GAACTGTGA
 
Protein sequence
MISRQRLLLW TACALVVLLV IGALVQVVRN LLWDLSYLLP PWLLGPVFFL TAGLIILMIY 
QVGWPWWKAF KRQNLETAQN NQRPLSPPSS RHQAAKQSLE SIDRLLERLQ DDVTREGLKQ
ERERVADELA RGDLMVVVFG TGSSGKTSLI RALLNEMVGE VGAPMGSTTS SQIYRLRLKG
LDRGLQLADT PGILEAGRAG LSREKEARQR ASRADLMVVV VDCDLRASEL EVISSLANLG
KRLLLVLNKC DLRGEEEERR LLAQLRGRCK GLLEAEDVIS CSAAPQSVPR PGKRPLQPPA
EVDNLLRRLA SVLHADGEEL LADNILLQCR HLGDAGRQLL DRQRQHEARQ CVDRYSWISG
GVVAATPLPG VDLLGTAAVN AQMVMEVARV YGVQLTRNRA QELAVSVGRT LAGLGIVKGG
VAIIGTALSV NLPTLLLGRA VQGVAAAWLT RVAGASFMTY FQQDQDWGDG GMQEVVQRHY
DLNRRESSLE RFLTTALRRV VEPLQREKRR QLPPRPGPRE VADASDHGHP EL