Gene A9601_14271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14271 
SymbolgalE 
ID4718148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1199934 
End bp1201001 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content29% 
IMG OID640079148 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_001009818 
Protein GI123968960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTG TTCTTACCAC AGGAGGACTT GGATATATAG GAAGTCACAC GGTAATTGCA 
CTTATAAATC GGGGTTTTAA TGTTTTGATT ATTGATTCAT TAATAAATTC TAAGTCGGAA
ACGTTTAATA ATATTGAAAA AATTTTATTT AATGAGATGG GTGAAATTAA AGAAAAATTA
TTTTTTAGGA AAGGAGATTT AAGGAACAAA TTATGGCTTG AAAATATTTT TCAGGAATTT
AATGATAAAA AACAACCTAT CGAGGCCGTC ATTCACTTCG CAGGTTTAAA ATCTATAGGA
GAATCTATAT TAAATCCCTT AAATTACTAT GATGTAAATC TCAACACTAC TTTATGTCTC
CTTTCAGTAA TGTCTAAATT TAAATGCTTT AAATTGATAT TTAGTAGTAG CGCAACTGTT
TATAAAATTG ATAAAAATGA AAAGATATCA GAAAATGGAA TCCTTTCACC TCTTAATCCA
TATGGAAATA CAAAATTAAG TAACGAAAAA ATAATCGAAG ACGTTTTTAA AAGCGACGAT
AAAAGATGGA AAATAGCTAA CTTGAGGTAT TTCAATCCTT GTGGAGCTCA TGATTCAGGA
ATAATTGGAG AAAATCCCTT AATAAATCAT TCAAATATAT TTCCTACAAT TTTAAGGGTA
ATTAATAGAG AGATTGAAAA ACTTCCTATT TACGGATCCG ATTGGCCTAC TAAAGATGGG
ACATGTATTA GAGACTATAT TCATGTAATG GATTTAGCAG AAGCTCATTT AGCTGCACTT
ATTTATTTAT ATGAAAATGA GCCGACTTAC CTTAATCTCA ATATTGGAAC GGGTACAGGT
ATAAGTGTAC TAGAACTTAT TAAGACCTTT AGCAATGTAA ATAATTGTCA AATTCCATAT
TACTTTACTG AAAAAAGAAA AGGTGATGCT GCTTTCGTTG TTGCGAATAA TTCTTTAGTT
ATTCAAACTT TAAAGTGGGA ACCTAAGAGA AACCTAAAAG ATATTTGCAA AGACTCATGG
CGTTGGTTTA TCAAAAGTAA AGAAGGAAGT AATTTTAAAA ATAATTGA
 
Protein sequence
MKTVLTTGGL GYIGSHTVIA LINRGFNVLI IDSLINSKSE TFNNIEKILF NEMGEIKEKL 
FFRKGDLRNK LWLENIFQEF NDKKQPIEAV IHFAGLKSIG ESILNPLNYY DVNLNTTLCL
LSVMSKFKCF KLIFSSSATV YKIDKNEKIS ENGILSPLNP YGNTKLSNEK IIEDVFKSDD
KRWKIANLRY FNPCGAHDSG IIGENPLINH SNIFPTILRV INREIEKLPI YGSDWPTKDG
TCIRDYIHVM DLAEAHLAAL IYLYENEPTY LNLNIGTGTG ISVLELIKTF SNVNNCQIPY
YFTEKRKGDA AFVVANNSLV IQTLKWEPKR NLKDICKDSW RWFIKSKEGS NFKNN