Gene NATL1_07771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07771 
Symbol 
ID4780841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp710036 
End bp711589 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content33% 
IMG OID640084052 
ProductGTPase SAR1 and related small G proteins 
Protein accessionYP_001014600 
Protein GI124025484 
COG category[R] General function prediction only 
COG ID[COG0486] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.1398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0362439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCC ATAACTCTAG GAAAATAATA TTTTATGTAT TGATAATTTT CATTAGCTTG 
ATAATTATAG GTCTGGTCGG AGCAATAATT AGATTGATAA ATATACCTGC CATATTAATT
ACAGTATTAA TAATATGTGG TTTAAGTTAT ACAAAAAAAA TAGACTGGTT ACAAAATAGT
CTAAGATCAA TATTTAAAAT AAAGGATGAG AAGAAATCAT TAGATTTATC GCTAATTAGC
AAAAAAGAAG CCGCAGATAA ATCATTAAAA AGTATTGATC ATTTAATCAC ATTAATCAAT
GACAAAGTCA AGGCCAAGGC CTTAAAGGAT GAAAAGGATA GGGTTTCATT AGAGTTAGAT
AGAGGAGATA TTATTTTAGT AGTTTTTGGA ATTGGCTCAA GCGGTAAAAC TTCATTAATA
AGAGCCCTAT TAAAAAAAAT AGTAGGTAAA GTTAGTCCTG AAATGGGATC AACGAGAGGG
AAAGAAACCT TCCGACTGAA ACTTAAAGGA CTTACAAGAG GAATAAGAAT AATTGACACT
CCTGGCATAC TGGAATCCGG GAGAGGGGGT AGAGAGAGGG AAAAAAGTGC GTTAATGGAA
GCACGTAAAT CTGATTTAAT GTTAGTAGTA ATTGAAGGTG ATTTACGTTC TGAAGAAACA
AGAACAATTA GGAGTTTGTC AAAATTAGGA AAAAGACTTT TACTTGTGCT AAATAAAATA
GATTTAAGAG GAGAAAGTGA AGAAAAAAGA TTAATTGAGA TACTAAATTC TAGATGTAAT
GATTTTATTG GTCCAAATGA CATTATTTGT ACATCAGCAT CACCTCAGAC AATTGCAGTC
ACTGGCAGAA AGCCTTATCA ACCAGCCCCT GAAATCAATA GTTTAATTAG AAGATTAGCA
AATATACTTC ATGAAGAAGG TGAAGAATTA ATTGCGGATA ATATTTTACT TCAATGCAGC
AATATTGGAA AAGAAGGGAA AAATTTATTA ATCAAACAAA GAACTCAATC TGCTAAAAAA
TGTATAGATA AGTATGGGTG GCTCAGCAGC GGTGCATTAA TACTAACTCC AGTTCCTGTC
TTAGACATGA TCGCTGCAGC GGCTGTAAAT GCACAAATGG TAATAGAAAT AGCTAAAATA
CATGGAGTTA AACTTACAAA TGAAAGGGCA AAGAATTTAG CGCTTTCGGT AGGAAAAATA
CTTGCAACTA TGGGTATAGT TAAAGGTGGA GTTTCTCTAA TAAGTTCAAC ATTAAGTTTA
TCACTACCAA CATTAGTTAT TAGCAAAGTA ATTCAAGGTA TTAGTGTATC TTGGCTTACT
AGGATTGCTG GAGCAAGTTT CATTACTTAT TTCCAACAAG ATCAAGACTG GGGAGATGGA
GGAATACAAG AAGTTGTTGA ATATCACTAC AACTTAAACA AAAGGGAGGA ATATTTTAAA
AGTTTTATTC GGAGAGCTTA TGAGAGAGTT ATTGATCCGC TAGTTGAAAA GAATTTGAAA
AAGCTACCAC CGAGATCAAG GCCTCCGAAG GAGGGGGACT CATCGGTCCT CTAA
 
Protein sequence
MKIHNSRKII FYVLIIFISL IIIGLVGAII RLINIPAILI TVLIICGLSY TKKIDWLQNS 
LRSIFKIKDE KKSLDLSLIS KKEAADKSLK SIDHLITLIN DKVKAKALKD EKDRVSLELD
RGDIILVVFG IGSSGKTSLI RALLKKIVGK VSPEMGSTRG KETFRLKLKG LTRGIRIIDT
PGILESGRGG REREKSALME ARKSDLMLVV IEGDLRSEET RTIRSLSKLG KRLLLVLNKI
DLRGESEEKR LIEILNSRCN DFIGPNDIIC TSASPQTIAV TGRKPYQPAP EINSLIRRLA
NILHEEGEEL IADNILLQCS NIGKEGKNLL IKQRTQSAKK CIDKYGWLSS GALILTPVPV
LDMIAAAAVN AQMVIEIAKI HGVKLTNERA KNLALSVGKI LATMGIVKGG VSLISSTLSL
SLPTLVISKV IQGISVSWLT RIAGASFITY FQQDQDWGDG GIQEVVEYHY NLNKREEYFK
SFIRRAYERV IDPLVEKNLK KLPPRSRPPK EGDSSVL