Gene NATL1_08641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08641 
SymbolwecC 
ID4781298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp795754 
End bp796965 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content30% 
IMG OID640084139 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_001014687 
Protein GI124025571 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000650417 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAAG TATGTGTCAT TGGTCTAGGG TACATAGGTC TTCCGACTGC TGTTGTCTTA 
GCTAAAGCAG GTCACAGTAC ATTAGGAATT GATATAGACC CACATGTCGT TAAAAATGTA
AACAATGGAG TTACTCATTT TAGAGAGCCT AATTTAAACA GTTCATTAAT ATCAGTTGTA
TCTAATGGAA TGCTACGTGC TTCTGAAAAG ATAAAGGATG CTGATATATT TATAATTGCT
GTTCCTACAC CATTCAGAAA AACAAATAAC AAAAATCCAG TTCCTAATCT AGATTATGTT
ATCTCTGCAA TAAATTCTAT TTTACCTTTT ATTAGGAAAG GAAACAGTAT AGTAATAGAG
TCAACATGTC CAATAGGGAC AACAGCTAAA GTAAAAAATA TTATTATTGA GAATACTAAG
TTAAAAGAAA ATGATTTTAA TCTTGCATAC TGCCCAGAAA GAGTTATACC TGGAAATATT
ATGGTTGAAC TAATAAATAA TGATAGAGTT ATTGGTAGTC ATAATAATCA GTCAAAAGAA
AAGATAATTA ATTTTTATAA AACTTTTTGT AAGGGTAATA TTTTATCTAC AACTGCTGAA
ACAGCTGAAA TGGTTAAACT AACAGAAAAC TCATTTAGAG ATGTAAATAT TGCTTTTGCT
AATGAATTAT CCATTATAAG TGATCATATA AATATTGACG TTATCGAACT GATTAAATTA
GCCAATTTTC ACCCAAGAGT TAACATACTA AAACCTGGTT GTGGTGTAGG AGGACATTGC
ATTGCAGTAG ACCCTTGGTT TATCGCTTCA GAAGTCCCCG AAAAATCCCA ATTAATACAA
ACCGCTAGAA AGATTAATGA CTATAAGCCT CAATGGGTTA TAGAAAAAAT AACTAAAAAG
GCTCGTGAAC TTAAATCTAC TCTTAGAAAA GATCCAATTA TTGGCTGCTT AGGTATTACA
TTTAAACCAA ATGTAGATGA TTTAAGAGAA TCCCCTGCAC TAAAGATAGT TCAAGGTCTT
GATAAAACTC AACTAAAATT GATTGTAGCT GATCCTAATA TTAAAAGTCA TAATGAATTA
AGTATATCTC CACTAAATAA TTTAATTAAT AATAGTGATC TTTATGTTTT CTTAGTGGCT
CATAATGAAT TTAAATATAT TTCATTAAAG AATAGAGAAT ATTTAGATTT TTGCGGAGTA
ATAAATAAAT AA
 
Protein sequence
MAKVCVIGLG YIGLPTAVVL AKAGHSTLGI DIDPHVVKNV NNGVTHFREP NLNSSLISVV 
SNGMLRASEK IKDADIFIIA VPTPFRKTNN KNPVPNLDYV ISAINSILPF IRKGNSIVIE
STCPIGTTAK VKNIIIENTK LKENDFNLAY CPERVIPGNI MVELINNDRV IGSHNNQSKE
KIINFYKTFC KGNILSTTAE TAEMVKLTEN SFRDVNIAFA NELSIISDHI NIDVIELIKL
ANFHPRVNIL KPGCGVGGHC IAVDPWFIAS EVPEKSQLIQ TARKINDYKP QWVIEKITKK
ARELKSTLRK DPIIGCLGIT FKPNVDDLRE SPALKIVQGL DKTQLKLIVA DPNIKSHNEL
SISPLNNLIN NSDLYVFLVA HNEFKYISLK NREYLDFCGV INK