Gene NATL1_15861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15861 
Symbol 
ID4779541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1292899 
End bp1293990 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content36% 
IMG OID640084868 
ProductGTP-dependent nucleic acid-binding protein EngD 
Protein accessionYP_001015408 
Protein GI124026292 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0012] Predicted GTPase, probable translation factor 
TIGRFAM ID[TIGR00092] GTP-binding protein YchF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGG CCGGAATTGT TGGACTCCCT AATGTTGGGA AGTCCACCTT GTTCAATGCT 
CTTGTTGCGA ATGCTCAAGC TCAAGCAGCA AACTTTCCAT TTTGTACGAT TGAACCCAAT
GTAGGCTCTG TTTCAGTCCC TGACCAACGT TTGAATTTGC TTGGTGAACT TAGTAATAGT
AAGCAAATTA TTCCAACGAG AATGGAGTTT GTTGATATTG CTGGTCTTGT TAAGGGAGCA
AGTAAAGGGG AGGGACTGGG TAACAAGTTT TTAGCAAATA TAAGGGAAGT GGATGCAATA
GTTCATGTGA TTAGATGTTT TAGAGATGAT GATGTTATTC ATGTTTCTGG ATCAGTAGAC
CCATCAAGAG ATATTGAGAT AATAAATTTA GAATTAGCAT TGTCTGATTT AAATCAAATA
GAAAAACGTA GAACTCGATT AAAAAAACAA ATAAGCACTA TTAAGGAAGC AAAGTTAGAA
GATGATGTAT TGGAAAAATT AAGCGAGGCT CTAGAAAATG AAAATGCAGT TAGGAGTGTT
TCCTTAACTG ATGAAGAAAA GAAATTAATT AAACCATTAG GCTTATTAAC TGAAAAACCA
ATTATTTATG CAACTAATCT TGGGGAAGAT GAACTTGCGA AGGGTAATTC CTTTTCAGAT
GAAGTAAATA CACTCGCAAC GAAAGAAGGG TCTGAATGTG TGAAGATTTC AGCGCAAGTT
GAAGCTGAGT TAATTGAATT GGGGGAGGAG GAAAGAGATG ATTATCTAAA TGGTTTAGGA
GTTGAAGAAG GAGGCTTAAT TAGTCTTATT AAAGCTACAT ATCGATTGTT GGGTTTAAGC
ACTTATTTCA CTACTGGAGA AAAAGAAACT AAAGCTTGGA CTATTTCTGA TGGGATGACA
GCTCCTCAAG CTGCCGGGGT AATACATACA GATTTTGAAA AGGGATTTAT TCGAGCTCAA
ACAATCTCAT ACAAAAAACT ACTTGAAGCA GGATCTTTAG TGGAAGCTCG AAACAAAGGT
TGGCTTAGAA GTGAAGGTAA AGAATATGTA GTTAATGAAG GAGACGTTAT GGAGTTTTTA
TTCAACGTCT AA
 
Protein sequence
MLKAGIVGLP NVGKSTLFNA LVANAQAQAA NFPFCTIEPN VGSVSVPDQR LNLLGELSNS 
KQIIPTRMEF VDIAGLVKGA SKGEGLGNKF LANIREVDAI VHVIRCFRDD DVIHVSGSVD
PSRDIEIINL ELALSDLNQI EKRRTRLKKQ ISTIKEAKLE DDVLEKLSEA LENENAVRSV
SLTDEEKKLI KPLGLLTEKP IIYATNLGED ELAKGNSFSD EVNTLATKEG SECVKISAQV
EAELIELGEE ERDDYLNGLG VEEGGLISLI KATYRLLGLS TYFTTGEKET KAWTISDGMT
APQAAGVIHT DFEKGFIRAQ TISYKKLLEA GSLVEARNKG WLRSEGKEYV VNEGDVMEFL
FNV