Gene Franean1_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1030 
Symbol 
ID5669444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1209528 
End bp1210790 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content73% 
IMG OID641239959 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001505392 
Protein GI158312884 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGCG GGACCAGGCT GGTCGGCGAG GTCGCCGTTC CGGGCGCGAA GAACTCGGTG 
CTGAAGCTGA TGGCGGCGAG CCTGCTCGCG CCCGGCCGCA CCACGTTGGA CGCGGTACCC
GACATCCTTG ACGTCTCGGT GATGGCCGAC GTCCTGCGTG GCCTCGGGGC CGAGGCGGAG
CGTGACCGGG CCGGGGGTCG GATCGTCATC GACATACCCG AGGTGGTCGA TGCCACCGCG
GACGCCGAGC TCGTCCGCCG GATCCGGGCG TCGGTGGCCA TTCTGGGGCC GCTCGTCGCC
CGCCGCGGGG AGGCCCGGGT GGCGCTGCCC GGCGGCGACG CGATCGGCTC GCGCGCGCTC
GACATCCACA TGAACGGCCT GACGAAGCTC GGTGCCGTGG TGGACGTCGA GGCCGGGGTA
CTGGTGGCCC GCTGCTCCGG GCGGCTGCAG GGCGCCTCGA TCTGGCTGGA CTTCCCGAGC
GTCGGGGCCA CCGAGAACCT GCTGATGGCC GGGGTGCTCG CCAAGGGCAC GACCGTGATC
GACAACGCGG CCCAGGAGCC GGAGATCGCC GACCTGTGCG CGCTGCTCAC CGCCATGGGC
GCCCGGATCG ACGGCGCGGG CACGTCCACG CTCGTCATCG AGGGGGTGGA AGGGCTGCGG
CCGGTGCTGC ACCGCACCGT CCCGGACCGG ATCGTCGCAG GTACGTGGGC GATCGGCGCG
CTGATGACCG GTGGTGACGT GACCATCCGG CACGGCCGGG CCGAGCATCT CGGGATCGTC
CTGGAGAAGC TCGCGGGCGC GGGGGCGGCC GTCGAGCTCG TCGAGGACGG TTTCCGGGTG
AGCGCGGACG GGCAGCCGCG CTCGATCGAC GTCGTCACGC TGCCCTACCC GGGCTTCCCG
ACCGACCTCC TGCCGCAGGT GATCGCACTG GAGGCGGTCA GCGAGGGAAT CTCGCTGATC
ACCGAGAACG TCTTCGACAG CAGGTTCGTG TTCTGCCGGG AGCTGCACGC GCTCGGAGCC
GACCTGCGCA CCGACGGCCA TCACGTGGTC GTCCGGCCCA CCCCGCGGCT GACCGGCGCC
TCCGTGCTCG CCTCGGACGT CCGCGCCGGC GCGGCGCTGG TGCTGGCAGG CCTGGTGGCG
GAGGGCACGA CCGTGGTCCG TGACGTCCAC CACATCGACC GCGGGTACGC GCACTTCGTG
GAGAACCTCA CCGCGCTCGG CGCCGAGATC CGTCGCGAGC CGTCCTCGGC GGCCGCGGCC
TGA
 
Protein sequence
MTGGTRLVGE VAVPGAKNSV LKLMAASLLA PGRTTLDAVP DILDVSVMAD VLRGLGAEAE 
RDRAGGRIVI DIPEVVDATA DAELVRRIRA SVAILGPLVA RRGEARVALP GGDAIGSRAL
DIHMNGLTKL GAVVDVEAGV LVARCSGRLQ GASIWLDFPS VGATENLLMA GVLAKGTTVI
DNAAQEPEIA DLCALLTAMG ARIDGAGTST LVIEGVEGLR PVLHRTVPDR IVAGTWAIGA
LMTGGDVTIR HGRAEHLGIV LEKLAGAGAA VELVEDGFRV SADGQPRSID VVTLPYPGFP
TDLLPQVIAL EAVSEGISLI TENVFDSRFV FCRELHALGA DLRTDGHHVV VRPTPRLTGA
SVLASDVRAG AALVLAGLVA EGTTVVRDVH HIDRGYAHFV ENLTALGAEI RREPSSAAAA