Gene NATL1_08591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08591 
Symbol 
ID4781273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp788809 
End bp790689 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content30% 
IMG OID640084134 
Productnucleotide-diphosphate-sugar epimerase, membrane associated 
Protein accessionYP_001014682 
Protein GI124025566 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000225321 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGCCA GCGCTGCTAA TAAAATTCTC TCTTTATCGT CTAAAAAGCG GCTTTCAATT 
CTTATATTTA TTGATATAGT CATAATTATA TTTTCGAGTA AGTTAGGACT ATATTTAACA
AGTAGAGATT ACTTTAATGA TTCTTACTTT TCTATATTCA TAACACTTAT TACAATATTT
ATAGGTATAA CCTGTTATAT AATAAGTGGA CAATATATTG GTATAACTAG ATATTTAAGC
AGTAGAGATT TAAATCATCT TATAATTAGG AATTTTATAA TTACAGTATT AACAAGAATA
ACTCTTATCT TCTTTCAAGT AGAATTACCA TCCTTAGGAT ATTTTATTTT ATTATGGATT
TTATTATCAA CATCAAGTGT ATATATTAGA TTTTTTATGC GTGATTTTAT CCTAAAAATT
AAGATCTCAA AAAATAAAAA TAGGAAAAAA GTTATTATTT ATGGGGCAGG TGAGGCAGGG
GCTCAACTAG CATCATCCTT AATTCAGGAT GGGCGTTATT GCGTTGAGGG ATTTATCGAT
GATGATTCAA GCCTTTGGCG GAGGAACATA AAAGGTATAC CCATATATCC TCCAAATAGA
ATTTATGAAA ATAAAGCTCA TATTGATCAG GTTCTTTTAG CAATACCTTC CCTAAGAAAA
AAAAAGCGAC TTGAAATTTT ACATACACTC TATAAAAAAG GGGTTTCGGT ACTTCAAATA
CCTTCTATTG ATGAAATAAA GAGTGAAAAG AATCTTATTA CATCATTAAA ACCTGTAAAA
GTTGAAGATA TTCTCGGACG AGAGCCAATT AATCCTGATA ATAATCTTTT AAACATAGCA
GTGAAAGGGC AAACAGTTTG CATAACAGGT GCAGGTGGAT CTATAGGTAG TGAATTATCC
AAACAAATAT ATAATTTAAA CCCCTATAAA ATGATATTAA TAGATCATAG TGAATCTCAT
CTTTATAATA TAAATAAGCA AATTACTTCC TATCCTGATA ATGGTATAGA AGTTAAAGCA
ATTCTAGGAA GTACAACAGA TTTACCATTT ATTAATAAAG TTTTTACTGA TAATAATGTA
GATATAATTT TTCATGCTGC TGCATATAAA CATGTTCCTC TTGTTGAATC AAATCCCTTA
AAAGGCTTAT TTAATAATGT TTTTTCTACT GAAATAGTTT GTAAAGCAGC ATTAGAAGCA
GGAGCTAATA ATTTAGTTCT GATCTCAACA GATAAAGCTG TTCGACCCAC CAATGTAATG
GGTGCCTCCA AGAGGCTTTC AGAATTAGTT GTTCAAGCGA TTGCAGAGAA ATCAAAAGAG
AATTCTATTG CTAAAAAAAC ATGTTTTTCT ATGGTTCGAT TTGGGAATGT ACTTGGATCT
TCTGGTTCAG TTTTACCACT TTTTCAAGAG CAAATTGATA ATGGTGGTCC AATAACTTTG
ACCCATCCAA GAATAATTAG ATATTTTATG ACTATTTCGG AAGCTTCTCA ATTAGTAATT
CAATCAAAGG TCCTTGCAGA GGGGGGGGAT GTATTTCATC TCGATATGGG AAAACCAGTG
AGCATTAAAT CATTAGCAGA GCAATTAATA CTTTTAAATG GTTTATCTAT TAAAGATAAT
AAAAATTTAG AAGGAGATAT AGAAATAAAA TTTACTGGTC TAAGACCAGG AGAAAAATTA
TATGAAGAAT TGATCATAGA TGCAGAATCT AAGAAAACAA TTCATCCTCT TATCTATCGT
GCAGATGAGA GATTTATTCC TTTGGATATA ATTATGCCAA CATTAGAAAT ACTACGAAGA
TATTTAGATA ACGAGGATAA AATCAATAGT CTTTTGATTT TGAAAGAGCT TGTGCCTGAA
TGGCAGACTA ATTTAATTTA A
 
Protein sequence
MLASAANKIL SLSSKKRLSI LIFIDIVIII FSSKLGLYLT SRDYFNDSYF SIFITLITIF 
IGITCYIISG QYIGITRYLS SRDLNHLIIR NFIITVLTRI TLIFFQVELP SLGYFILLWI
LLSTSSVYIR FFMRDFILKI KISKNKNRKK VIIYGAGEAG AQLASSLIQD GRYCVEGFID
DDSSLWRRNI KGIPIYPPNR IYENKAHIDQ VLLAIPSLRK KKRLEILHTL YKKGVSVLQI
PSIDEIKSEK NLITSLKPVK VEDILGREPI NPDNNLLNIA VKGQTVCITG AGGSIGSELS
KQIYNLNPYK MILIDHSESH LYNINKQITS YPDNGIEVKA ILGSTTDLPF INKVFTDNNV
DIIFHAAAYK HVPLVESNPL KGLFNNVFST EIVCKAALEA GANNLVLIST DKAVRPTNVM
GASKRLSELV VQAIAEKSKE NSIAKKTCFS MVRFGNVLGS SGSVLPLFQE QIDNGGPITL
THPRIIRYFM TISEASQLVI QSKVLAEGGD VFHLDMGKPV SIKSLAEQLI LLNGLSIKDN
KNLEGDIEIK FTGLRPGEKL YEELIIDAES KKTIHPLIYR ADERFIPLDI IMPTLEILRR
YLDNEDKINS LLILKELVPE WQTNLI