Gene NATL1_18431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18431 
Symbol 
ID4780604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1505182 
End bp1506504 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content33% 
IMG OID640085132 
ProductGTPase SAR1 and related small G proteins 
Protein accessionYP_001015663 
Protein GI124026548 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.491382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTACC CAAGAGATAT AGCTACAAAA TGCAAGTTTT TACTTGGTCA ATGGAAAGAG 
AACTTAAATC TCACTAATTA CGAAAGAACA AAATTTGAAG ACACTTTAAA TCAACTTGAT
TTTCAAATAA ATAAATTAGA GAAAAAAGAA CTACAGATAT CAGTGCATGG CAGAGTAGGA
GTTGGTAAAT CAAGCTTATT AAATGCATTA ATTGAAAAGC AAATATTTCC AACTGATATA
ATTAACGGTA ATACAAAAAC CAGTAAATCT TATAAATGGG ACGAAAGGTT TCAAGGATTA
AATAAGGTTG ATCTAATCGA CTCTCCTGGC ATAGATGAAA TAAATAATTC TAATAAAGAA
GAAATAAATT TTAATACTGT CCTAGACACA GATTTAATTC TTTATGTAAT TGATAGTGAT
ATAACGAGAG TCGACATGAA CTCCATTGAA GATCTATTAA GGCATAACAA ACCAATACTA
ATAGTCTTAA ATCGTTGTGA TCAATGGAAT AGAAGAGAAA CAAAACTAAT ACTCTCAAGT
GTTCATAGGA AATTATCATT TTGTAAACAA AAGGTTAAAA TTGCTCTAGT ATCTTCATCT
CCAAGGAAAG CAAAAATAAA ACCAGACGGA ACTATTAGGA GTGAGAAAAC AATCCCTAAA
GTTGGTATTC TCAAGAATGA ACTTAAAGAT ATTATCGACA AAAGTGGTGA ATTTTTTCTT
TGTATAAATA CTTTAAGAAT TGCAGACCGA CTCTACAACT TACTCAAAGA GAATCGACTA
CTGAAAAAGA AAAAAGAAGC ACAAAATTTA ATCGGCAGAT ATGCAACTTT AAAAGCCTCA
GGGGTAGCAC TTAATCCCTT CTTAATGATT GATCTTATTA CCGGTCTAGC TTTTGATAGT
TCTCTTATTA TTCAACTAAG TAAATTATAT GGGTTAGAAG TAGGTGGCCC CACCGCAAGG
CAATTAGTAA AAAAGCTTAG TTTCCAAAAT TCATTACTAG GGGGTGCGCA GATAGGAATA
CAAATTACCT TAAATATTCT CAAGCAAATA ATGATATTTG CAGCACCTCT TACTGGAGGA
TTAAGCCTTG CGCCCACTGC TCCTATAGCC ATTGCTCAAG CTGCTCTTGC TATTCATGCG
ACAAAACTTA TAGGTCGCCT CGCAGCTTAT AAATTTCTAA TTGGGACAAG TAGGAACGAT
GGCAGGCCTC GATTAATGTT GAACTATCTT CTCAAAAACA ACTCAGACTT TAGAATAATG
ATTGGTGACT TTAAATTTCT TACATCAAGT ACGGAAAAAA ATAAAAATTA TTTGTTGCCA
TGA
 
Protein sequence
MIYPRDIATK CKFLLGQWKE NLNLTNYERT KFEDTLNQLD FQINKLEKKE LQISVHGRVG 
VGKSSLLNAL IEKQIFPTDI INGNTKTSKS YKWDERFQGL NKVDLIDSPG IDEINNSNKE
EINFNTVLDT DLILYVIDSD ITRVDMNSIE DLLRHNKPIL IVLNRCDQWN RRETKLILSS
VHRKLSFCKQ KVKIALVSSS PRKAKIKPDG TIRSEKTIPK VGILKNELKD IIDKSGEFFL
CINTLRIADR LYNLLKENRL LKKKKEAQNL IGRYATLKAS GVALNPFLMI DLITGLAFDS
SLIIQLSKLY GLEVGGPTAR QLVKKLSFQN SLLGGAQIGI QITLNILKQI MIFAAPLTGG
LSLAPTAPIA IAQAALAIHA TKLIGRLAAY KFLIGTSRND GRPRLMLNYL LKNNSDFRIM
IGDFKFLTSS TEKNKNYLLP