Gene NATL1_07021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07021 
SymbolstpA 
ID4780402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp644483 
End bp645682 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content32% 
IMG OID640083978 
Productputative glucosylglycerolphosphate phosphatase 
Protein accessionYP_001014527 
Protein GI124025411 
COG category 
COG ID 
TIGRFAM ID[TIGR02399] glucosylglycerol 3-phosphatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0547585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAT TTAAGATGAA AGAAGAAATT ATTAACACTA TAATAAACGA ACAAAATATT 
CTAATAGTTC AGGATCTTGA CGGCGTTTGT ATTCCACTAG TTCAAGACCC ACTTCAAAGA
GAGATCAACA AAGACTATGT GAAGGACGTT TCAAGATTAA GAGAGAAGTT TGCAGTTCTA
ACCTGCGGAG AGCATGAAGG CAAAAGAGGC GTTAATCGCT TAGTAGAAAA GGCACTTGAT
TCGAAAAAAA CTGCAAAAGA AAATGGCTTT TACTTACCTG GCCTTGCAGC GTGTGGAGTT
GAGTATCAAG ATAGATTTAG TAATTTATCC TATCCAGGGC TCAAAGATAA TGAGATTAAC
TTTTTAGCAG AAGTTCCAAA GATGATGAGA TCAATGTTAA CTAATGAATT AAAAAAATTC
TTACCAAACC TTTCGAATGA GAAAAGAAAT AAATTAATTG ATGTGGCTGT ATGTGATACG
CGCTTTACAC CTACTTTAAA TTTCAATGAA ATCTTTAGCT ACGTTAAAAA TGATTTTCAA
CAAGTTAAAG ATTTGCAATT GATTATGGGA AAAATAATGA ATGATTTGCT CGAAGAATCT
AAAAATTTTG GCTTAGATAA TTCTTTTTAT CTGCATTTGA TGCCTAATCT AGGAATAAGA
GATGGCAGAG AAATAATGAA ATATTCTACT CAAAATGAAT TTGGAACAAC AGATATACAG
TTCATTATCA AAGGTGCAAT AAAAGAAGCA GGCCTTTTAT TTCTATTAAA TAAATACATA
GCAAATAAAA CTGGCGTTTA TCCATTCGGT GAAAACTTCA ATGTCAGGAA TGCTCCTAAG
ACGCATGCTC AATTAATAAA GCTATGCAGA GATAAAATAC CGCACGAACA AATGCCACTT
CTAGTAGGTG TTGGCGATAC GGTAACCTCG GTTAAAGATA ATAAAGATAA TTCTTGGTTA
AGAGGTGGAA GTGATCGAGG TTTTTTAACA TTGATCCAAA GGTTGGGAGA ATCATATAAG
AAAGATAATC AAGTTGTATT TGTTAACAGC TGCAACGAGC AGGTATTAAG ACCAAGAATA
AATGGAACTG ATATGCAAGG AATTAGTGAT CCTAATGATG ATTTAAAATT CAATATGATT
ATTAATGATG GACCAAAAGA ATATATTGAG TGGTTTAAAC AATTAGCTAG TAACTTTTAG
 
Protein sequence
MKIFKMKEEI INTIINEQNI LIVQDLDGVC IPLVQDPLQR EINKDYVKDV SRLREKFAVL 
TCGEHEGKRG VNRLVEKALD SKKTAKENGF YLPGLAACGV EYQDRFSNLS YPGLKDNEIN
FLAEVPKMMR SMLTNELKKF LPNLSNEKRN KLIDVAVCDT RFTPTLNFNE IFSYVKNDFQ
QVKDLQLIMG KIMNDLLEES KNFGLDNSFY LHLMPNLGIR DGREIMKYST QNEFGTTDIQ
FIIKGAIKEA GLLFLLNKYI ANKTGVYPFG ENFNVRNAPK THAQLIKLCR DKIPHEQMPL
LVGVGDTVTS VKDNKDNSWL RGGSDRGFLT LIQRLGESYK KDNQVVFVNS CNEQVLRPRI
NGTDMQGISD PNDDLKFNMI INDGPKEYIE WFKQLASNF