Gene NATL1_08731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08731 
Symbol 
ID4779752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp810945 
End bp812066 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content31% 
IMG OID640084148 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001014696 
Protein GI124025580 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR03568] UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.572404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000175685 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAAAA AAGTTTTATT ATTTGTGACT GGGACGAGAG CTGATTTTGG TAAGATGGAG 
CCATTGGCAC GAGAAGCATT CAATAATGGA TTCAAAGTTA TTTTCTTCGT AACAGGTATG
CATATGATGA GAGAATATGG TCTAACAAAA GAAGAAGTTC ATAAAAATAA AGATATACAA
ATTTTCGAAT TCAGCAATCA AAAATATGGT GATAAATTAG ACACTATATT ATCTAATACT
GTGAGAGGGT TTTCTAATTA TGTTAAAGAA ATTAATCCTG ATATAGTTAT CATTCATGGA
GACAGAATAG AGGCAATAGC ATGCAGCTTA GTATGTTCAA CTAATAATAT AATAAGTGCA
CATATTGAGG GTGGAGAAGT ATCAGGAACC ATTGATGAGG TTTTTAGACA TTGCAATACC
AAGCTTTGTA CCTTTCACTT GGTTAGCTCT AATGAAGCAA AAAAGAGAGT AAGGCAAATG
GGCGAGCCAG AAAAAAATAT TTTTGTTATT GGTTCACCAG AATTAGATAT CCACGGAAGG
AAGTCTGGAG TAGATTTATT GCAAGTAAAA GAGAGATATA AAATTGATTT TAAGGAATAC
GGAATATGCA TCTTTCATCC TGTAACAACT GAAGAAAATC AAATAAAAAT ACAAGCAGAG
AATCTATTCA AATCATTAAG TATTAGTAAT AGAAACTTTG TAATTATTTT ACCCAATAAT
GATCCAGGAT CAATTTATAT ATGTAATGAG ATTGATAAAC TAAATAGCAA CAACTTTCGA
ATAATACCAT CAATGAGATT CAATTATTTT TCAGAGTTAA TGAAAAACTC ATCCCTAATA
ATAGGTAACT CGAGTTTAGG TGTAAGAGAA GCTCCATTTC TTGGAATCAT GTCGATAAAC
ATAGGAACTA GGCAAAATAA AAGAGCTTTA ACGCAATCGA TATATAATTG CAGTGGTCAA
TCAATCCCTG AAATTGTCGA CGCCATAGGA AAATTTTGGA ATAAAAAAAC CACAAGTCAT
AAAGGCTTTG GGAGTGGCAA CTCGAGAAAA AAGTTTTTAA AATTCATCAA TTCTGATAAG
ATATGGAATC AAAGTACTCA AAAAAGCTTT GAAGAGCTTT AA
 
Protein sequence
MDKKVLLFVT GTRADFGKME PLAREAFNNG FKVIFFVTGM HMMREYGLTK EEVHKNKDIQ 
IFEFSNQKYG DKLDTILSNT VRGFSNYVKE INPDIVIIHG DRIEAIACSL VCSTNNIISA
HIEGGEVSGT IDEVFRHCNT KLCTFHLVSS NEAKKRVRQM GEPEKNIFVI GSPELDIHGR
KSGVDLLQVK ERYKIDFKEY GICIFHPVTT EENQIKIQAE NLFKSLSISN RNFVIILPNN
DPGSIYICNE IDKLNSNNFR IIPSMRFNYF SELMKNSSLI IGNSSLGVRE APFLGIMSIN
IGTRQNKRAL TQSIYNCSGQ SIPEIVDAIG KFWNKKTTSH KGFGSGNSRK KFLKFINSDK
IWNQSTQKSF EEL