Gene Franean1_5202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5202 
Symbol 
ID5673536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6244734 
End bp6246488 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content72% 
IMG OID641244056 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001509466 
Protein GI158316958 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.53746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTGT CCCACCGGAT ACCGGCGGCC AGGCCCAGGC CGCAGACCGG CCACACCGCG 
ACCTCTCCCA ACACGACCGG GCTGGACGAC GGCGGTGCCG GCGGTCAGAC CGTGCCCGGT
CAGGGCGCCA GGGGCCAGAG CATTCCCGGC CAGGGTGCCG GCGGGCCGGG TGTCCCAGGT
CAGGGTGCCG GCGGGCCGGG CGTCGGGGCC CAGGGCGTCG GTGGTCAGAA CGTCGGTGGT
CAGGGCGCCG AGGAACAGAA CGTCGCGGAC CAGGGCGCCG CGGGAGAGAG CACCGCCGGG
TGGGGGCCGG CGGAACGAGG AGGAATGACC GCAATGACCG GGACGGCGGG AACGACCGCA
CGAACAGGTC CCTCTGTGCC GACCGCGCCG TCCACGCCGA TCGCCCCGAC CGGGCCGACC
GTTCCCACGG TCCCGAACCG GCGCCTCCCC CCGGACCGCC AGGGCGCGGC CGGCCGGATG
ACCATGCCCG ACCGGGTGCC GCGCACGGAG ATCATGGGAG TTCCCGCGCC GGTGCCGCGC
GGCCGACCCG GGCCGGCCGA GCACCGGGCG CGGGTCATGA TCCTGGTCGG CACCCGTCCG
GAGATCGTGA AGCTGAGCCG AATCATCGCG GCGCTCGAGC GGGCCGTGGA CGTCTGCCTG
GTCCACTCGG GCCAGCACTA TGACTACGAG CTGAACCAGG TGTTCTTCGA CGAGCTCGGT
ATCCGCAAAC CGGACCACTT CCTCGACGCT GTCGGCGCGA GCGCCGCCGA GACCATCGGC
CGGGTGATCG CCCGCTCCGA CGCGGTGTTC GTCGACGAGT CCCCGGACGC GCTGCTGCTC
TACGGGGACA CGAACACCAC GCTCGCGGTC ATTGCGGCCC GGCGGCGGCA CATCCCGGTG
TTCCACCTGG AGGCCGGGAA CCGCTGTTTC GACGACCGGG TCCCGGAGGA GATCAACCGC
CGGCTGGTCG ACCACCTCAG CGACATCAAC CTCCCGCTCA CCGAGCACGC GCGGCGCCAC
CTGCTGGCCG AGGGCCTGCC GGCGCAGCGG ATCTTCGTCA CGGGCTCGCC GATGAAGGAG
GTCCTCGACC ACTACGCGCC GCTCGTCGAC GCCTCACCAG TGCTCACGAA CCTCGGTGTC
ACCGCGGGCC ACTTCCTGGT GGTCAGCGCG CACCGGGAGG AGAACGTCGA CGCGCCGGAG
CTGCTCATCG GCCTGCTGGA GACACTCAAC GCGCTGGCCG CCCGCTACCG GGTGCCGATC
ATCGTCTCCA CCCACCCGCG TACCCGGGAT CGTCTCGACG CCCTTGAGGC GTCCGGCCGC
GCCCCTGCGA CCGACGGCCT CGTCCGTTTC TGCCGGCCCT TCGGGTTCGC GGACTACATC
GCGTTGCAGC GGGCGGCGCA GTGCGTGATC TCGGACAGCG GCTCGCTGAC CGAGGAGGCC
TCGCTGCTCG GGTTCCCCGC GGTGATGATC CGGGAGGCGC ACGAGCGCCC TGAGGGCGTC
GACCACGGAG TGGCGGTCTC CTGCCTGCCC CGGCCGGACC GGGTCCTCGC CGCGGTCGAC
CTGGTCGTCG ACGCGGCGCA GGGGGACCGG GCACCGCGGA TCGTCCCCGA TTACGACGTG
GACGACGTCT CCCGCCGCGT CGTACGGATC ATCGTCAGCC ACATCGACTA CGTCCGCCGC
ACCGTCTGGT TCGAGCGCCC ACCAGTCGGG ACCAGCGAAC CAACGCCCGG AGGCACCTCC
CTGACCCTCC CGTAG
 
Protein sequence
MGVSHRIPAA RPRPQTGHTA TSPNTTGLDD GGAGGQTVPG QGARGQSIPG QGAGGPGVPG 
QGAGGPGVGA QGVGGQNVGG QGAEEQNVAD QGAAGESTAG WGPAERGGMT AMTGTAGTTA
RTGPSVPTAP STPIAPTGPT VPTVPNRRLP PDRQGAAGRM TMPDRVPRTE IMGVPAPVPR
GRPGPAEHRA RVMILVGTRP EIVKLSRIIA ALERAVDVCL VHSGQHYDYE LNQVFFDELG
IRKPDHFLDA VGASAAETIG RVIARSDAVF VDESPDALLL YGDTNTTLAV IAARRRHIPV
FHLEAGNRCF DDRVPEEINR RLVDHLSDIN LPLTEHARRH LLAEGLPAQR IFVTGSPMKE
VLDHYAPLVD ASPVLTNLGV TAGHFLVVSA HREENVDAPE LLIGLLETLN ALAARYRVPI
IVSTHPRTRD RLDALEASGR APATDGLVRF CRPFGFADYI ALQRAAQCVI SDSGSLTEEA
SLLGFPAVMI REAHERPEGV DHGVAVSCLP RPDRVLAAVD LVVDAAQGDR APRIVPDYDV
DDVSRRVVRI IVSHIDYVRR TVWFERPPVG TSEPTPGGTS LTLP