Gene NATL1_08581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08581 
SymbolwecB 
ID4780247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp787607 
End bp788719 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content32% 
IMG OID640084133 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001014681 
Protein GI124025565 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000168245 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAAACGTA TATCAGTAAT TATTGGAACA AGGCCTGAGG CTATAAAATT TGGTCCGTTA 
ATTCTTGCTT TTCTAAAAAC CAAAGAAATA GATTTAAGGA TTATTTCAAC AGGTCAACAT
TATGAACTAG TAGATCAAGT AAATGAATTG TTTAAAATTG TGCCCAACAA GAACCTTAAG
ATTATGGTTC CTGGGCAAAG TCTTACAAAG ATAACTAATG AAGTTTTAAT AGGATTAAAA
GAAGACTTTA ATGAATATCC ACCCGATTTA GTATTAGTCC AAGGAGATAC TACTTCAGCC
TTTTCTGCTG CTCTGGCAGC ATTTTATGAA AAAATTCCAA TAGGGCATAT TGAAGCTGGG
TTAAGAACAA ATCAAATTAT GCTTCCATAT CCTGAAGAAG CGAATAGAAG AATTATTTCC
CAAATAGCTT CTATTCATTT TGCTCCTACT AAAATTGCTT TTGAAAATCT AAAAAAAGAA
TCTGTACTTG GTGAAGTTTA TTTAACAGGA AATACTGTTG TTGACAGCTT ATTATTTATA
TCAGAAAAAG CACAAATCCC AAAAATTAAA AATGTAGATT TTATAAAACA AAAAATCATA
TTAGCTACAG TTCACAGACG TGAAAACTGG GGGGCGAATT TAAAACAAAT AGCAAAGGGT
TTAAAAAAGA TTTTGGATGA ACATCTCGAT TATATTCTAA TCCTTCCAAT GCACCCAAAT
AAGTCACTTA GAGAACCATT AGAGGAAATA CTTGGAGTGC ATGAAAGAGC TATATTAACA
GAATCGTTAT CTTACAACTC ACTAGTTGGA ACACTTAAGC ACACTAAATT ATTATTAACT
GACTCTGGAG GCCTACAAGA AGAAGCTCCC ACATTTGGAG TGCCTGTATT AGTCCTAAGA
GATTCAACAG AACGGCCAGA AGCAATAAAA GCTGGAACTG CAAAAATTGT TGGATCAAAC
CCAAATAAGA TTTTCAAAGA AGCTAATAAT CTTTTAACTA ACCAAAAAGA ATATCAAAAG
ATGTCTAAAG CAATCAATCC TTTTGGAGAT GGTAAAGCAA GTGAAAGAAT TGTAAAATAT
TGTATTGAAT TTCTTGAAAG AAATAAGAAA TAA
 
Protein sequence
MKRISVIIGT RPEAIKFGPL ILAFLKTKEI DLRIISTGQH YELVDQVNEL FKIVPNKNLK 
IMVPGQSLTK ITNEVLIGLK EDFNEYPPDL VLVQGDTTSA FSAALAAFYE KIPIGHIEAG
LRTNQIMLPY PEEANRRIIS QIASIHFAPT KIAFENLKKE SVLGEVYLTG NTVVDSLLFI
SEKAQIPKIK NVDFIKQKII LATVHRRENW GANLKQIAKG LKKILDEHLD YILILPMHPN
KSLREPLEEI LGVHERAILT ESLSYNSLVG TLKHTKLLLT DSGGLQEEAP TFGVPVLVLR
DSTERPEAIK AGTAKIVGSN PNKIFKEANN LLTNQKEYQK MSKAINPFGD GKASERIVKY
CIEFLERNKK