Gene Franean1_1701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1701 
SymbolaroB 
ID5670103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2033332 
End bp2034447 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content74% 
IMG OID641240619 
Product3-dehydroquinate synthase 
Protein accessionYP_001506045 
Protein GI158313537 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.16166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.109076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGCGA TGAAGGTTTC CGACATGGTC CGCATCCAGG TCACCCCGTC CGGTGACCGG 
GCCTACGACG TGGTGCTGGG CGAGGGCGCC CTCTGCGAGC TACCGGCGCT GGCCGCGGGG
CGCACCCGGG TCATGGTCAT TCACCCGCGA GCGCTGCGGG CCACCGCCGC GGCGGTCATC
GCCGAGCTGC GGGCCGGCGC CGGTGCCGGC GTCGAGACGC ACGCGTTCGA GGTGCCGGAC
GGCGAGGAGG CCAAGCAGCT GCGGGTCGCG GGCGCCTGCT GGGACGCCCT CGGCCGTGTC
GGCTTCACCC GCGACGACCT GGTGGTCGGG CTCGGCGGCG GGACGACGAC CGACCTGGCC
GGGTTCGCCG CGGCGGGCTG GCTGCGTGGT GTCGACGTCA TCCAGGTGCC CACCACCGTG
CTCGGGATGG TCGATGCGGC GGTCGGTGGC AAGACGGGCA TCGACATCGA GGCGGGCAAG
AACCTCGTCG GCGCCTTCCA CCAGCCGCTG GCGGTGCTCT GTGACCTGTC GACCCTGGCC
AGCCTGCCGG CGGTCGAGGT CCGGGCCGGG CTCGCCGAGG TCGTCAAGGC CGGGTTCATC
GCCGATCCGC GCATCCTCGA GCTGCTGGAG GCCGATCCGA CCGGGTCGGC GCGGCTGCCC
GAGCTCGTCG AGCGGTCGAT CCGGGTCAAG GCGGCGGTGG TGTCCGGCGA CCCGCGCGAG
GCCGGCCGGC GCGAGATCCT GAACTACGGG CACACCCTCG CCCACGCGAT CGAGAAGGTC
GAGAACTTCT CCTGGCGGCA CGGCGCGGCG GTCTCGGTCG GCATGGTCTT CGCCGCCGAG
CTCTCCCGGC TCGTCGCCGG GCTCGACCGC GTGACCGCCG ATCGCCACCG CGAGCTGCTG
CGGGCCATCG GGCTGCCGGT GGAGTACCGG GGGGACCGCT GGCCGGCGCT GCTCGACGCG
ATGCGGGTGG ACAAGAAGAC CCGGGGCCGG CGGTTGCGTT TCGTTGTGCT CGAAGCGCTC
GGCCGGCCGC GCGGATTCGA CGATCCCGAG CCCGGCCTGC TGCTGGCCGC GTACGGCTCG
GTCGCCGCGG GCGGTGTGAG CGCTACCGGG AACTGA
 
Protein sequence
MCAMKVSDMV RIQVTPSGDR AYDVVLGEGA LCELPALAAG RTRVMVIHPR ALRATAAAVI 
AELRAGAGAG VETHAFEVPD GEEAKQLRVA GACWDALGRV GFTRDDLVVG LGGGTTTDLA
GFAAAGWLRG VDVIQVPTTV LGMVDAAVGG KTGIDIEAGK NLVGAFHQPL AVLCDLSTLA
SLPAVEVRAG LAEVVKAGFI ADPRILELLE ADPTGSARLP ELVERSIRVK AAVVSGDPRE
AGRREILNYG HTLAHAIEKV ENFSWRHGAA VSVGMVFAAE LSRLVAGLDR VTADRHRELL
RAIGLPVEYR GDRWPALLDA MRVDKKTRGR RLRFVVLEAL GRPRGFDDPE PGLLLAAYGS
VAAGGVSATG N