Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_19081 |
Symbol | |
ID | 4779845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1570425 |
End bp | 1571501 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640085198 |
Product | WD-40 repeat-containing G-protein |
Protein accession | YP_001015728 |
Protein GI | 124026613 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.352529 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGTA TAGAAGCATT TAGTCCCAAG GGAATGCTTC ATGAAAGTTG GTCTGCTCAA GCTAACGACT ACGCAATTGT CTGCGGCTGG GCACTACAAG GTAAAACTTT TTTAGTAGGT GATGTCGCTG GTGGGCTTTA TGCATTTGAG GGAATATCTG GAAAGCTCAT TTGGCAAATA AAAGACATAC ATAAAGGTGG CTTACTCGCA ATGTCTATAC ATCCAAATGG AAAGACTTTT GCAACTGCTG GCCAAGATGG ACATGTAAAT ATATGGGAAA GCCAAAAGGG TACGTCAACT AAAACTTTGG AACTTGGGAA AGGATGGGTT GAGCACATCA AGTGGTCCCC AGACGGAAAA TTTTTAGCTG TAGTTTTTAC TAAATACGTC TATGTTTTTG ATGATAAAGG TCAAGAACAT TGGCGATCAG ATGAGCATCC CAGCACTGTC AGCGCGATTG CTTGGTCTAA TTCAAATGAA TTAGCAACAG CATGCTATGG CCAAGTCACT TTTTTTGATG TAGTAAACGA CAAGATCAAT CAAAAGTTGG AATGGCGAGG CTCGCTAGTA TCGATGGTGC TTAGTCCAGA TGGAGACATA GTGGCATGCG GCAGCCAAGA TAATTCTGTT CATTTCTGGC GTCGTTCAAC TGATCAAGAT TCAGAGATGA CAGGCTACCC AGGTAAACCA AGCCACCTAG CTTTTGATCA AACCGGCACA GTCCTTGCTA CTGGGGGTAG TGATCGCGTG ACGGTTTGGA GTTTTCAAGG CGATGGTCCT GAGGGAACTG TACCAGGAGA GTTAATGCTT CATACGGAAC CCATTTCATG TCTTGCTTTT TCACACAGCG GGATGCTTCT TTTAGCTTCT GGCGCGAGAG ATGGTTCAGT TTTTTCTTGG TTTCTCCAAA AAGATGGTCA GGGTGATCCA GTTGGTGGTG CATTTGCCGG TGACCTTGTA AGCCAAATCG CTTGGCACCC TGATGACACT GCTTTGGCTG CAATAAATGC AAACGGAGGA ATTACGGTTT GGGAGTTTAA GGTTCGGACG AAAACGTCAG CTCAAGGATT CGGATAA
|
Protein sequence | MPGIEAFSPK GMLHESWSAQ ANDYAIVCGW ALQGKTFLVG DVAGGLYAFE GISGKLIWQI KDIHKGGLLA MSIHPNGKTF ATAGQDGHVN IWESQKGTST KTLELGKGWV EHIKWSPDGK FLAVVFTKYV YVFDDKGQEH WRSDEHPSTV SAIAWSNSNE LATACYGQVT FFDVVNDKIN QKLEWRGSLV SMVLSPDGDI VACGSQDNSV HFWRRSTDQD SEMTGYPGKP SHLAFDQTGT VLATGGSDRV TVWSFQGDGP EGTVPGELML HTEPISCLAF SHSGMLLLAS GARDGSVFSW FLQKDGQGDP VGGAFAGDLV SQIAWHPDDT ALAAINANGG ITVWEFKVRT KTSAQGFG
|
| |