Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_16501 |
Symbol | |
ID | 4777445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1441515 |
End bp | 1443113 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087159 |
Product | GTPase SAR1 and related small G proteins |
Protein accession | YP_001017659 |
Protein GI | 124023352 |
COG category | [R] General function prediction only |
COG ID | [COG1100] GTPase SAR1 and related small G proteins |
TIGRFAM ID | [TIGR00231] small GTP-binding protein domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.569133 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCCC GACAGCGGCT GCTCCTATGG ACCGCCTGCG CGCTGGTGGT CTTGCTCGTC ATCGGAGCTC TAGTGCAAGT GGTGAGGAAC CTGCTCTGGG ATTTGAGTTA CTTACTACCT CCCTGGTTGC TAGGCCCGGT GTTTTTCCTA ACAGCAGGCC TCATCATCTT GATGATCTAT CAAGTTGGAT GGCCCTGGTG GAAGGCTTTT AAGCGTCAGA ACTTAGAAAC TGCTCAAAAC AATCAACGGC CCCTTTCCCC TCCAAGTAGC CGTCATCAAG CAGCCAAGCA AAGTCTTGAA AGCATTGATC GCCTGCTCGA ACGCCTCCAA GACGATGTCA CTCGAGAAGG GCTTAAGCAA GAGAGAGAAA GGGTGGCTGA TGAACTGGCC CGTGGTGATC TGATGGTGGT GGTGTTCGGT ACTGGTTCCA GCGGCAAGAC ATCTCTGATC CGAGCACTCC TGAACGAAAT GGTGGGTGAG GTTGGTGCAC CCATGGGATC CACAACCAGT AGCCAGATTT ATCGACTGCG TTTAAAGGGA CTTGATCGAG GCCTTCAACT GGCTGACACC CCAGGAATTC TTGAAGCAGG CAGAGCCGGT TTAAGCCGAG AGAAGGAAGC AAGGCAACGA GCCAGTAGAG CTGACCTGAT GGTGGTGGTT GTGGACTGTG ATCTACGCGC TTCAGAGCTG GAGGTCATCA GTAGCCTCGC CAATCTCGGC AAACGATTGC TCCTGGTTCT GAATAAATGC GATCTACGCG GTGAAGAAGA GGAGCGACGG CTTTTGGCGC AGTTGCGAGG GCGATGCAAG GGCTTGCTTG AAGCTGAGGA TGTGATCTCC TGTAGCGCTG CACCCCAGTC AGTGCCGCGT CCCGGCAAAC GACCTTTGCA GCCTCCGGCT GAGGTCGACA ACCTGCTGCG TCGCCTTGCG TCAGTGCTAC ACGCCGATGG TGAAGAACTA CTGGCAGACA ACATTCTGCT GCAATGTCGC CATCTAGGAG ATGCCGGTCG CCAGCTGCTG GATCGACAAC GACAACATGA AGCGCGTCAG TGTGTTGATC GTTACAGCTG GATCAGTGGT GGTGTCGTCG CTGCAACCCC CCTCCCAGGA GTGGATCTAT TGGGGACGGC AGCGGTGAAT GCCCAAATGG TGATGGAGGT CGCCCGGGTC TATGGAGTTC AACTCACTCG CAACCGAGCA CAAGAACTGG CGGTATCAGT AGGCCGCACT TTGGCAGGAC TCGGCATTGT TAAAGGTGGA GTGGCGATCA TCGGCACAGC TCTCAGTGTC AACTTGCCCA CCCTTTTGCT GGGCCGAGCG GTACAAGGGG TCGCTGCTGC TTGGCTCACA CGCGTTGCGG GAGCGAGCTT CATGACCTAC TTCCAGCAGG ATCAAGACTG GGGGGATGGC GGCATGCAGG AAGTGGTTCA ACGTCACTAC GATCTCAACC GACGAGAATC TTCGCTGGAA CGTTTTCTCA CGACAGCCCT GCGGCGGGTG GTGGAGCCTC TACAGCGGGA GAAACGGCGA CAGCTCCCGC CACGCCCAGG GCCTCGGGAG GTGGCGGACG CATCGGACCA CGGGCATCCA GAACTGTGA
|
Protein sequence | MISRQRLLLW TACALVVLLV IGALVQVVRN LLWDLSYLLP PWLLGPVFFL TAGLIILMIY QVGWPWWKAF KRQNLETAQN NQRPLSPPSS RHQAAKQSLE SIDRLLERLQ DDVTREGLKQ ERERVADELA RGDLMVVVFG TGSSGKTSLI RALLNEMVGE VGAPMGSTTS SQIYRLRLKG LDRGLQLADT PGILEAGRAG LSREKEARQR ASRADLMVVV VDCDLRASEL EVISSLANLG KRLLLVLNKC DLRGEEEERR LLAQLRGRCK GLLEAEDVIS CSAAPQSVPR PGKRPLQPPA EVDNLLRRLA SVLHADGEEL LADNILLQCR HLGDAGRQLL DRQRQHEARQ CVDRYSWISG GVVAATPLPG VDLLGTAAVN AQMVMEVARV YGVQLTRNRA QELAVSVGRT LAGLGIVKGG VAIIGTALSV NLPTLLLGRA VQGVAAAWLT RVAGASFMTY FQQDQDWGDG GMQEVVQRHY DLNRRESSLE RFLTTALRRV VEPLQREKRR QLPPRPGPRE VADASDHGHP EL
|
| |