Gene NATL1_04431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04431 
SymbolengA 
ID4780821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp405593 
End bp406963 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content41% 
IMG OID640083720 
ProductGTP-binding protein EngA 
Protein accessionYP_001014272 
Protein GI124025156 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0422674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCTAC CAGTAGTCGC AATAATTGGA CGCCCAAATG TTGGGAAATC TACATTGGTG 
AATCGCTTAT GTCAGAGCAG AGAAGCCATT GTTCATGATG AGCCGGGGGT AACGAGAGAT
CGAACTTATC AAGATGGATT CTGGAGGGAT AGAGATTTTA AAGTTGTAGA TACTGGAGGG
CTGGTTTTTG ATGACGATAG TGAGTTCCTC CCTGAAATTA GAGAGCAAGC TAATCTTGCG
CTTGAGGAAG CCGTAGTTGC ATTAGTAATT GTTGATGGCC AGGAGGGAAT TACTACCGCT
GATGAATCAA TTGCTGAATT TTTAAGGTCT CGTTCCTGCA AAACCCTCGT GGTGGTTAAT
AAATGTGAAT CTCCCGAACA AGGTTTAGCA ATGGCAGCTG AATTTTGGAA GCTTGGTCTT
GGTGAGCCCT ATCCAATCTC TGCCATACAT GGAGTAGGTA CAGGCGATCT GCTTGATCAG
GTGGTTAATT TGTTTCCCTC TAAAGATTTA GATGAAGTTA GTGATTCTCC TGTTCAATTG
GCAATTATTG GGAGACCAAA TGTAGGTAAG TCCAGTCTTC TTAATTCTAT TTGTGGAGAG
ACAAGGGCAA TTGTTAGCTC TATTAGGGGT ACAACTCGAG ATACGATTGA TACTCGAATT
ACTCATCAGG GTAAGGAATG GAAATTAGTT GATACGGCGG GAATACGTAG ACGTAGAAGT
GTTAATTATG GCCCAGAATT TTTTGGTATT AATCGCAGTT TTAAGGCAAT AGAAAGAAGT
GATGTCTGTG TGTTGGTTAT AGATGCTTTG GATGGCGTCA CAGAACAAGA TCAAAGGCTT
GCAGGTAGAA TTGAGCAGGA AGGAAGAGCT TGTTTGATAG TCATTAATAA ATGGGATGCT
GTAGAAAAAG ATAGTCACAC AATGTCTGCA ATGGAAAAAG ACATTCGTTC AAAATTATAT
TTTCTCGATT GGGCCCAGAT GATCTTTACA TCAGCAGTTA CGGGTCAAAG AGTAGAAGGT
ATTTTTGCAT TAGCTACTTT GGCCGTTGAT CAGAGTAGAA GAAGGGTAAC TACTTCAGTT
GTTAATGAGG TGCTGACTGA GGCCTTAAAA TGGAGAAGTC CTCCTACAAC AAGAGGGGGA
AAACAAGGGC GTCTTTATTA CGGTACTCAA GTAGCTATTA ATCCTCCCAG TTTTACTCTG
TTCGTGAATG AACCTAAATT ATTTGGTGAA ACTTATCGAA GATATATTGA GAGACAAATT
AGAGAGGGTC TTGGTTTTGA AGGGACTCCT ATAAAGTTAT TTTGGAGAGG GAAGCAGCAA
CGCGATGTCG AAAAAGATAT GGCACGCCAA CAGAAAGGGG TCCAAAATTA G
 
Protein sequence
MALPVVAIIG RPNVGKSTLV NRLCQSREAI VHDEPGVTRD RTYQDGFWRD RDFKVVDTGG 
LVFDDDSEFL PEIREQANLA LEEAVVALVI VDGQEGITTA DESIAEFLRS RSCKTLVVVN
KCESPEQGLA MAAEFWKLGL GEPYPISAIH GVGTGDLLDQ VVNLFPSKDL DEVSDSPVQL
AIIGRPNVGK SSLLNSICGE TRAIVSSIRG TTRDTIDTRI THQGKEWKLV DTAGIRRRRS
VNYGPEFFGI NRSFKAIERS DVCVLVIDAL DGVTEQDQRL AGRIEQEGRA CLIVINKWDA
VEKDSHTMSA MEKDIRSKLY FLDWAQMIFT SAVTGQRVEG IFALATLAVD QSRRRVTTSV
VNEVLTEALK WRSPPTTRGG KQGRLYYGTQ VAINPPSFTL FVNEPKLFGE TYRRYIERQI
REGLGFEGTP IKLFWRGKQQ RDVEKDMARQ QKGVQN