Gene Emin_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1520 
Symbol 
ID6263585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1611523 
End bp1612461 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content41% 
IMG OID642612007 
ProductADP-L-glycero-D-manno-heptose-6-epimerase 
Protein accessionYP_001876404 
Protein GI187251922 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02197] ADP-L-glycero-D-manno-heptose-6-epimerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00376715 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.69674e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATAAAA AAAGATATTT AGTAACAGGC GGCGCGGGGT TTATCGGAAG CAATATCGCT 
TTTGAATTAC AAAACCAGGG CCATGAAGTA ACAATAATGG ACGATTTTTC CTCAGGCAAT
TTTAAAAACC TTTTAGGTTT TAAAGGGGAT GTAACAGCGG CCGATGTTTT TAAATTCATG
CCGGAAGATG TTTACTTTGA CGCTATTTTC CATGAAGCAG CCATTACGGA CACAACTATC
CATGACCAAA AATTAATGAT GGAAATGAAT GTTGAGGCGT TTAAAAACGT TCTTCATTTC
GCGGCCAGCA ATGAAATTAA AAGGGTTGTT TATGCCTCTT CCGCGGGCAC ATACGGACAA
AACCCCTGCC CTATGACGGA GACGCAGGTT CCCATGCCGG AAAACGTTTA CGGCTTTTCC
AAAGCTGTTA TGGATAATGT CGCGCGCGAG TTTGCCTCGG ACCACCAGGA TATGGTTATT
GTAGGCCTTC GCTATTTTAA TGTTTACGGC CCCGGTGAAT ATTACAAAGG ACACACAGCA
AGCATGATAT ACCAGCTTTA TAATCAAATG AAAGCGGGTA AAAACCCAAA AATCTTTAAA
ATGGGTGAAC AACAAAGAGA TTTCGTTTAC ATTAAAGATG TTGTAAAAGC TAACCTTTGC
GCGCTTACGG CTAAAGAAAG CTGTGTAGTA AACGTAGGGT TCGGCACGCC CAGAACATAT
AACGACGTTG TTGCCTGTTT AAATAAAGAA ACGGGCCTTA ATTTACAGCC CGATTATATT
GACAACCCGT ATCCTTTTTT CCAATTAAAA ACCGAAGCGG ATTTAACTTT GGCTAACCAG
AAAATAGGAT ATACACCTGA TTACAACCTT GAAAAAGGCA TTGAGGAATA TGTGCAGATT
TTAAATAAAA GACCTGTGCA GCCTGCGGTA AAGAAATAG
 
Protein sequence
MDKKRYLVTG GAGFIGSNIA FELQNQGHEV TIMDDFSSGN FKNLLGFKGD VTAADVFKFM 
PEDVYFDAIF HEAAITDTTI HDQKLMMEMN VEAFKNVLHF AASNEIKRVV YASSAGTYGQ
NPCPMTETQV PMPENVYGFS KAVMDNVARE FASDHQDMVI VGLRYFNVYG PGEYYKGHTA
SMIYQLYNQM KAGKNPKIFK MGEQQRDFVY IKDVVKANLC ALTAKESCVV NVGFGTPRTY
NDVVACLNKE TGLNLQPDYI DNPYPFFQLK TEADLTLANQ KIGYTPDYNL EKGIEEYVQI
LNKRPVQPAV KK