Gene Francci3_3700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3700 
Symbol 
ID3903801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4434817 
End bp4436076 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content70% 
IMG OID637881026 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_482781 
Protein GI86742381 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.865403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGCG GTACCCGGCT GGCCGGCGAG GTCGCCGTCC CGGGCGCGAA GAACTCGGTA 
TTGAAGCTGA TGGCCGCGAG CCTGCTGGCC GCGGGCCGGA CGATGCTGTC CGCCGTTCCT
GACATCCTGG ACGTGTCGGT GATGTCCAAG GTGCTGGCGG GGCTCGGTGC GACCGTCGAG
CGGGACGCCG CCGGCGGGCG GCTGAGCATT GAGGTGCCCC AGGCTCCGGG CGTCGCGGCC
GATGGTGAGC TGGTGCGGCA GATCAGGGCG TCGGTGGCGG TGCTCGGTCC GCTGGTCGCC
CGCCGTGGGG AGGCGCGGGT CGCCCTGCCC GGTGGGGACG CGATCGGATC GCGCGCGCTC
GACATTCATG TCAACGGCCT GACCAAACTC GGTGCGCTGG TTGACGTCGA GTCCGGAGTC
CTCGTCGCGC GCTGCTCCGG CAGGCTGCAG GGAGCCTCGA TCTGGCTCGA CTTCCCCAGC
GTGGGCGCCA CCGAGAACCT GTTGATGGCC GGGGTTCTCG CCAAGGGGAC GACGGTGATC
GACAATGCCG CCCAGGAACC CGAGATTGCC GACCTGTGCG CGATGCTGAC GGCGATGGGC
GGGCGTATCG ACGGCGCGGG ATCGTCGACA CTGATCATCG ACGGGGTCGA CCAGCTCACC
TCGGTGGAAC ACCGGACGGT GGCGGACCGC ATCGTCGCCG GCACGTACGC CATCGGCGCC
CTGATGACCG GTGGCGACGT GCTGATCCGG CACGGTCAGG CCGCCCACCT CAGCATCGTC
CTAGAGAAGC TGGTGCAGGC CGGTGCCGAG GTCGACGTCC GGGACGACGG ATTCCGGGTC
GTCGGACGGG GACAACCCCG GTCGATCGAC GTGGTGACGC TGCCGTACCC CGGCTTTCCG
ACTGATCTGC TGCCGCAGAT CATCGCGTTG GAGTCGGTTA GCCGGGGTAC CTCCCTGATC
ACCGAGAACA TCTTCGACAG CCGGTTCGCC TTCTGCCGCG AGCTGCACAA ACTCGGCGCC
GACCTGCGTA CGGACGGCCA CCACGTCATC GTCCGGCCCA CCGCGCGGCT GTCCGGCACC
CGGGTGACGG CCTCCGACGT GCGGGCCGGC GCGGGTCTCG TGCTCGCCGG CCTGGTTGCG
GACGGCGTCA CCGAGGTAAG CGACGTCCAC CACATCGACC GGGGATACGC CCGGTTTGTC
GAGAACATGA CGGACCTCGG CGCCGACATC CGCCGGGTCT CCGACCGCGC CGTAGCCTGA
 
Protein sequence
MTGGTRLAGE VAVPGAKNSV LKLMAASLLA AGRTMLSAVP DILDVSVMSK VLAGLGATVE 
RDAAGGRLSI EVPQAPGVAA DGELVRQIRA SVAVLGPLVA RRGEARVALP GGDAIGSRAL
DIHVNGLTKL GALVDVESGV LVARCSGRLQ GASIWLDFPS VGATENLLMA GVLAKGTTVI
DNAAQEPEIA DLCAMLTAMG GRIDGAGSST LIIDGVDQLT SVEHRTVADR IVAGTYAIGA
LMTGGDVLIR HGQAAHLSIV LEKLVQAGAE VDVRDDGFRV VGRGQPRSID VVTLPYPGFP
TDLLPQIIAL ESVSRGTSLI TENIFDSRFA FCRELHKLGA DLRTDGHHVI VRPTARLSGT
RVTASDVRAG AGLVLAGLVA DGVTEVSDVH HIDRGYARFV ENMTDLGADI RRVSDRAVA