Gene Franean1_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1664 
Symbol 
ID5670066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1988742 
End bp1989836 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content74% 
IMG OID641240582 
Product3-dehydroquinate synthase 
Protein accessionYP_001506008 
Protein GI158313500 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.825748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.49413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAC CGTCCGCGAC ACTCGGGCAC GACGACCGTG CCGGCGGGCC CGGGACACAC 
CGGGTATGGG TCGAGCTCGG CGACCGCGGT TATCCCGTCG ACATCGGGCC CGGCGTCCGG
CGCACGCTGC CGGAGGTCGT CGCCCGGATC GACGCGTCCC GGGTCGTGAT CGTCTCGGCC
CGGCCAGAGG ACGACGCGGT ACCCGATCCC GGGGTCCCGG CGCTGCGCGT CGCCGCCCGC
GACGGCGAGC CGGACAAGAA CCTGGCCAAC GTCCAGGCGC TCTGCGAGCG GTTCGCGGCG
TTCGGCCTGA CTCGCACGGA CGCGGTGGTC TCCTGCGGCG GCGGGACGAC GACCGACGTC
GTCGGGCTCG CGGCGGCGCT CTACCACCGC GGTATCGCGG TTATCCACCT CCCGACGTCG
CTGCTGGCGC AGGTCGACGC GAGCGTCGGC GGCAAGACGG CGGTGAACCT GCCGGCCGGC
AAGAACCTGG TCGGCGCCTA CTGGCAGCCC AGCGCCGTGC TGTGCGACAC CGACTACCTG
GCGACGCTGC CCGCGCGCGA GTGGACCAAC GGCTACGGGG AGATCGCGCG GGCGCACTTC
ATCGGCGCCG GCGACCTGCG CGGACGTCCC GTCGAGGAGC AGATCGCGCG CAGCGTGGCG
CTCAAGGCCT CGGTCGTGGC CCGCGACGAG CGCGACTCCG GCCTGCGCCA CATCCTCAAC
TACGGGCACA CCCTGGGCCA CGCGCTGGAG ATCGTCACCG GCTTCGAGAT GCGGCACGGC
GAGGGCGTGG CGGTCGGGAC GGTCTTCGCC GGCCGGCTCG CTGCCGCGCT GGGCCTGATC
GACGACCGGC GCGCCGACGA GCACTTCGAG GTGGTCTCGC ACTACGGACT GCCGACGGCG
CTGCCCGCCG GGGCCGACAC GGCGGCGCTC GTCGCCGCCA TGCGGCTGGA CAAGAAGTCC
ACGAACACCG GCAAGTCCAC GAACACCGGC CTCACCTTCG TCCTCGACTC GGCGGACGGC
CCCCGGCTGG TTCCGGACGT CCCGGCCGAC CTGGTGGTCG CGACGCTGGG CGCGATGGAA
CGCGAGGCGG CATGA
 
Protein sequence
MTAPSATLGH DDRAGGPGTH RVWVELGDRG YPVDIGPGVR RTLPEVVARI DASRVVIVSA 
RPEDDAVPDP GVPALRVAAR DGEPDKNLAN VQALCERFAA FGLTRTDAVV SCGGGTTTDV
VGLAAALYHR GIAVIHLPTS LLAQVDASVG GKTAVNLPAG KNLVGAYWQP SAVLCDTDYL
ATLPAREWTN GYGEIARAHF IGAGDLRGRP VEEQIARSVA LKASVVARDE RDSGLRHILN
YGHTLGHALE IVTGFEMRHG EGVAVGTVFA GRLAAALGLI DDRRADEHFE VVSHYGLPTA
LPAGADTAAL VAAMRLDKKS TNTGKSTNTG LTFVLDSADG PRLVPDVPAD LVVATLGAME
REAA