Gene SAG1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1923 
SymbolgalE 
ID1014733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1906951 
End bp1907946 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content38% 
IMG OID637317091 
ProductUDP-glucose 4-epimerase 
Protein accessionNP_688912 
Protein GI22538061 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTTC TGATTTTAGG TGGCGCCGGC TACATTGGCT CGCATATGGT TGATCAGTTA 
ATCACACAAG GAAAAGAGAA AGTTATTGTT GTTGATAACT TGGTAACAGG TCATCGTCAA
GCGGTGCATT CAGATGCGAT TTTTTATGAA GGAGATTTAT CCGATAAAAC GTTTATGCGT
CAAGTTTTTA GGGAAAATCC TGATGTAGAT GCTGTTATCC ATTTTGCGGC ATTTTCACTA
GTTGCTGAGT CAATGGAAAA TCCTTTAAAA TATTTTGATA ATAACACTGC TGGAATGATA
AAACTTTTAG AAGTAATGAA TGAATGTGAT ATCAAAAATA TTGTCTTTTC ATCAACAGCT
GCAACATATG GTATTCCAGA ACAAGTTCCT ATTTTAGAAA CAGCTCCTCA AAATCCTATT
AATCCTTATG GCGAAAGTAA GCTTATGATG GAAACAATTA TGAAATGGGC TGATCAAGCC
TACGGTATTA AGTTTGTCGC TCTACGTTAC TTCAACGTTG CTGGAGATAA ACCGGATGGC
TCAATTGGGG AAGATCATAA ACCAGAAACA CATTTGTTAC CAATCATTCT TCAAGTTGCT
CAAGGAGTAC GTGACAAAAT AATGATTTTT GGAGATGATT ACAATACTCC AGATGGAACT
AATGTTCGAG ATTATGTGCA TCCATTTGAT TTGGCAGATG CTCATATATT AGCAGTTGAT
TACCTTCGCC AAGGGAATGA ATCAAACGTG TTTAATCTCG GATCTTCGAC AGGTTTTTCT
AACCTTCAGA TGTTAGAGGC AGCTCGTCGT ATTACTGGGA AAGAAATTCC TGCTCAAAAG
GCAGCTCGTC GTCCAGGAGA TCCAGATACG CTTATTGCTT CCTCAGAGAA AGCTCGTCAA
ATCCTTGGGT GGGAGCCTAA ATTTGATAAT ATTGATAAAA TTATTTCATC GGCATGGGCA
TGGCATTCTA GTCATCCAAA TGGCTACGAA GATTAA
 
Protein sequence
MAVLILGGAG YIGSHMVDQL ITQGKEKVIV VDNLVTGHRQ AVHSDAIFYE GDLSDKTFMR 
QVFRENPDVD AVIHFAAFSL VAESMENPLK YFDNNTAGMI KLLEVMNECD IKNIVFSSTA
ATYGIPEQVP ILETAPQNPI NPYGESKLMM ETIMKWADQA YGIKFVALRY FNVAGDKPDG
SIGEDHKPET HLLPIILQVA QGVRDKIMIF GDDYNTPDGT NVRDYVHPFD LADAHILAVD
YLRQGNESNV FNLGSSTGFS NLQMLEAARR ITGKEIPAQK AARRPGDPDT LIASSEKARQ
ILGWEPKFDN IDKIISSAWA WHSSHPNGYE D