Gene NATL1_17201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17201 
SymbolmurA 
ID4779437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1404691 
End bp1406067 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content38% 
IMG OID640085004 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001015540 
Protein GI124026425 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGTG TGGCGACGAT TAAAAGGAAT ATTATTCCGC CACATCTGGA GGTAAAAGGA 
GGGCGTCCGC TTAGTGGGAT CTTAAAAGTT AGTGGGGCAA AAAATTCTTC GCTAGCTTTA
ATGGCTGCGG CGCTTCTTAC GAAAGAGAAA CTCCTCATTC AAAATGTCCC AGAACTTACG
GACATTGAAG TCATGTCAGA GATTCTCCGT AATTTGGGCG CAAAGTTAAC TAAAACAAAT
AATTCTATCG AAATTAATTC AGAGTCCATT CATAATGTTG AATTACCGTA TGAATTAGTT
CATAGCTTGA GAGCAAGCTT TTTCTGTGTT GGCCCCTTAC TTACAAGACT TGGAGAAGCA
AAAATTCCTT TACCTGGTGG TTGCAATATT GGAGCAAGAC CTGTCGATGA ACATATCAAT
GGTCTAAAAG CTTTAGGAGC GGAAGTTGAA GTTATAAATG ATGTTGTAAA AGCTAAAGTT
TCTACCAAAG ACAAAAGATT ACTTGGAGCG AATATTACTC TTAAATATCC CAGCGTTGGA
GCCACGGAAA CCATCTTGAT GGCTTCTTGC TTAGCTTCAG GCAAAACAAC AATATCGAAT
CCAGCCAGAG AGCCAGAGAT CCAGGATCTC GCAAAAATGC TTAATTCAAT GGGAGCAAAA
GTTTTTGGAG CAGGAACGAA AAGAATCACA ATCCTTGGAG TTGAATCTTT AAGCGGGACT
TCTCATTGTG TTATTCCAGA CAGGATAGAA GCAGGAACTT TTCTTATTGC TGCAGCAATA
ACACGATCGC CTCTCATTAT TGGTCCAGTA ATCCCAAATC ATTTGAGCGC TGTTATTTCA
AAATTAAAAG AATGTGGCTG TTCAATATCT CAACATGGGA ATCATCATTT AAAAATCATT
CCAATAGAGA TTTCAGGAGT TGATATAACA ACAAGTCCAT TCCCTGGCTT CCCAACTGAT
CTTCAGGCTC CATTTATGTC ACTAATGGCC ACCGCTAAGG GTTCAAGCAA AATCAAAGAA
AGAGTTTTTG AGAATAGAAT GCAACACGTT TTGGAACTAA ATAAAATGGG CGCCTGTATT
TATCTAGAAA ACAATACTGC TTATATAAAA GGAGTAAAAG AACTTGTAGG TTCAAATGTA
GAGGGAGGAG ATTTACGTTC TTCTGCTGCC ATTATCCTTG CATGTCTCTC TGCCAAAGGA
AATAGTATTT TCACGGGCCT CGAACACTTA GATAGAGGCT ATGAAAAATT AGAAGAAAAA
TTAACGAATG CAGGTTCTAT TATTTCTAGA AAATTTGATC AAATAACATC TCATAGTTCT
TTCTCTAACA AAATAATTAG TGAAGACAAT ATTGATACTC AAAAAAATGC AGCTTAG
 
Protein sequence
MSSVATIKRN IIPPHLEVKG GRPLSGILKV SGAKNSSLAL MAAALLTKEK LLIQNVPELT 
DIEVMSEILR NLGAKLTKTN NSIEINSESI HNVELPYELV HSLRASFFCV GPLLTRLGEA
KIPLPGGCNI GARPVDEHIN GLKALGAEVE VINDVVKAKV STKDKRLLGA NITLKYPSVG
ATETILMASC LASGKTTISN PAREPEIQDL AKMLNSMGAK VFGAGTKRIT ILGVESLSGT
SHCVIPDRIE AGTFLIAAAI TRSPLIIGPV IPNHLSAVIS KLKECGCSIS QHGNHHLKII
PIEISGVDIT TSPFPGFPTD LQAPFMSLMA TAKGSSKIKE RVFENRMQHV LELNKMGACI
YLENNTAYIK GVKELVGSNV EGGDLRSSAA IILACLSAKG NSIFTGLEHL DRGYEKLEEK
LTNAGSIISR KFDQITSHSS FSNKIISEDN IDTQKNAA