Gene A9601_14131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14131 
Symbol 
ID4718134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1184809 
End bp1186371 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content29% 
IMG OID640079134 
Productnucleotide-diphosphate-sugar epimerase, membrane-associated protein 
Protein accessionYP_001009804 
Protein GI123968946 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCTT TAACAATATG GATATTGACT GCTGCATTAG TTTCAAGTTT TTCATTTATT 
GTTCGACTAT TTTTAAAAGA CGTAATATTT TTCCTTAAAA ATAAACTTAA TAATAAACAA
AAAAATATTG TTATTTATGG AGCTGGTGAT GCTGGGAATC AACTTGCGAA TGCATTGTGC
TTGAGTCAAA AATATAAAAT TATAAGTTTT ATAGACGATT CTTCTAATCT TCAAGGTAGG
ACAATAGGAG GAATTCCAAT AAAAAGTCCA AATTATTTAA ATTTTCAAAA TTCAAAAGTT
GATAAAGTTC TTTTGGCAAT ACCTTCTTTG ACAAAAGAGA GAAAAAAAAC TTTATTAGAA
AATTTAGAAA AAAAATCTAT AGGTGTATTA CAAATCCCAT CTATTGATGA ATTAACTAGC
GGCTCGGCAC AAATTGACAC ATTGCGACCA GTTTCTCCTG AGGATTTATT AAGCAGAGAT
ATTGCAACTT ATGAAGATAA TAATCTAGAG GAATTAATAA AAAATAAAGT TGTTTGTATT
TCGGGAGCAG GCGGATCTAT TGGATCTGAA TTATGTAGAC AGATCATTAA ACTTAAACCC
AAAAAATTAA TACTTATTGA AATGAATGAG CATAGTCTTT ATAAAATTAA CTATGAATTG
ACTCAAAAAG AAATTTACGA GATTGAAATT ATTCCAATAC TGGAAAATGC ATCAAACTAT
AAATCTCTCA ATCTCCTATT TAAACAAATT AAAATTAATA TCTTATTTCA CGCAGCTGCT
TATAAACACG TACCCTTAGT AGAAATGAAT CCAATGTCAG GTCTGGCTAA TAATTTTTTA
TCAACAAGTA ATTTATGTAA ATTAGCATTA GAAAATTCAA TAGAGAGAAT AATTTTAATT
TCATCTGATA AAGCAGTAAG GCCTACTAAC TTAATGGGCG TTTCAAAACG ATTGTCAGAA
TTAATTTTTC AGGCATATTC CAAAATTGAT AATAAAAAAG ATGTTAATAA AAAGACTATT
TTTGCGATGG TTAGGTTTGG TAATGTACTT GGTTCTTCGG GATCAGTAGT GCCATTATTT
AATAAACAAA TTACTAAAGG AGGGCCTATA ACTTTAACTC ATCCAGATGT AATAAGATTT
TTTATGACTA TTTCAGAATC AGTACAATTA GTTCTACAAG CCGCCTTATT AGCTAATGGG
GGAGACTTAT TCATACTTGA TATGGGAAAA CCTGTAAAAA TATATGATCT TGCCATGAAA
ATGATTAATT TAAGAGGATT AAAAATTAAA AATAAAGAGA ATCCTGATGG TGATATTGAG
ATTATTTTTA CAGGTCTAAG GCCAGGGGAA AAACTTTTTG AGGAATTATT AATTGATGCC
GATACTGAAT CGACTATAAA TCCATATATT CTTCGAGCAC AAGAAAAATT TATTATGCCA
GAAAATCTTT TCCCTAGATT AGAAAAATTG GAATATCTTA TTGACTCAAG GGATTCAAAA
GAAGTTTGGA ATCTCTTGAA TGAAATTGTT CCTGAATGGA TTAGAAGTAA AGAACTAAAT
TAA
 
Protein sequence
MPALTIWILT AALVSSFSFI VRLFLKDVIF FLKNKLNNKQ KNIVIYGAGD AGNQLANALC 
LSQKYKIISF IDDSSNLQGR TIGGIPIKSP NYLNFQNSKV DKVLLAIPSL TKERKKTLLE
NLEKKSIGVL QIPSIDELTS GSAQIDTLRP VSPEDLLSRD IATYEDNNLE ELIKNKVVCI
SGAGGSIGSE LCRQIIKLKP KKLILIEMNE HSLYKINYEL TQKEIYEIEI IPILENASNY
KSLNLLFKQI KINILFHAAA YKHVPLVEMN PMSGLANNFL STSNLCKLAL ENSIERIILI
SSDKAVRPTN LMGVSKRLSE LIFQAYSKID NKKDVNKKTI FAMVRFGNVL GSSGSVVPLF
NKQITKGGPI TLTHPDVIRF FMTISESVQL VLQAALLANG GDLFILDMGK PVKIYDLAMK
MINLRGLKIK NKENPDGDIE IIFTGLRPGE KLFEELLIDA DTESTINPYI LRAQEKFIMP
ENLFPRLEKL EYLIDSRDSK EVWNLLNEIV PEWIRSKELN