Gene NATL1_03941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03941 
Symbol 
ID4780474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp364132 
End bp365088 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content31% 
IMG OID640083662 
Productnucleoside-diphosphate-sugar epimerases 
Protein accessionYP_001014223 
Protein GI124025107 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.873071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0436557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAT CTCCTGTAAA AAACTTGGTT ACTGGAGGGG CTGGCTTCGT TGGTTCTCAT 
TTGATTGATC GTTTAATGAA ATCTGGAGAA AAAGTTATAT GTTTGGATAA TTTTTTTACT
GGGAGTAAAG AAAATATTGA ACACTGGATT GGACATCCAT CTTTTGAGCT TATAGATCAT
GATGTTATAG AGCCAATCAA GCTTGATGTG GATAGGATTT GGCATTTAGC TTGTCCAGCA
TCTCCAATTC ATTATCAATT TAACCCTATT AAAACAGCGA AAACGAGTTT TTTGGGGACT
TATAATATGC TTGGATTAGC TAGGAAAGTT GGAGCTCGAA TATTATTAGC AAGTACTAGT
GAAGTTTATG GAAATCCCGA AATTCATCCT CAGCCTGAAA AATATAACGG CAATGTAAAT
CCTATAGGAA TTCGTAGTTG CTACGATGAG GGTAAACGTG TTGCGGAATC ATTGTGTTAT
GACTATATGA GAATGCATGG TTTAGAAATA AGAATTGCTA GAATATTTAA TACCTATGGT
CCTAGAATGT TATTAAATGA TGGAAGACTT ATTAGCAACT TATTAGTTCA ATCAATACAT
GGAAATGACT TGACTATTTA TGGCAATGGT AAGCAAACTA GAAGCTTTTG TTTTGTTGAT
GACTTAATAG ATGGTTTAAC TTTATTCATG AATTCTTTAA ATGTAGGACC TATGAATTTA
GGCAATCCTG AAGAATTATC TATTCTTCAA ATAACTAACT TCATAAGAAA TATCTCAATT
GAAAAAGTAA ATCTGAAATT TTTAAAAGCA CTAGATGATG ATCCTTTAAG AAGAAAGCCT
GATATTTATC TTGCAAAAAA AGAATTAAAT TGGGAGCCTA AAATAATGTT TAAAGAAGGA
TTAGCAATTA CAAGAAAGTA TTTTGAAAAG AAATTAATCT TTGAAAAAAG TAAATAA
 
Protein sequence
MPKSPVKNLV TGGAGFVGSH LIDRLMKSGE KVICLDNFFT GSKENIEHWI GHPSFELIDH 
DVIEPIKLDV DRIWHLACPA SPIHYQFNPI KTAKTSFLGT YNMLGLARKV GARILLASTS
EVYGNPEIHP QPEKYNGNVN PIGIRSCYDE GKRVAESLCY DYMRMHGLEI RIARIFNTYG
PRMLLNDGRL ISNLLVQSIH GNDLTIYGNG KQTRSFCFVD DLIDGLTLFM NSLNVGPMNL
GNPEELSILQ ITNFIRNISI EKVNLKFLKA LDDDPLRRKP DIYLAKKELN WEPKIMFKEG
LAITRKYFEK KLIFEKSK