Gene Francci3_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3207 
SymbolaroB 
ID3906173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3799777 
End bp3800865 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content72% 
IMG OID637880531 
Product3-dehydroquinate synthase 
Protein accessionYP_482293 
Protein GI86741893 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0774582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTGA AGGCCTCCGA CATCGTGCGG ATCCCGGTAC GGCCGGGCGG CGGGCGTCCC 
TACGACGTCG TGCTCGGGGT GGGCCTGCTC GGCGAGCTCG CCGAGACCGT CGTCGGGCGG
ACCCGCGCCG CGGTGATCCA TCCCGGGGCG TTGCGGGCCA CCGCGGACGC CGTCGTCGCC
GACCTGCGGG AGAACGCGGG CGTGGAGGCG CACGCCATCG AGGTGCCCGA CGGCGAGGAG
GCCAAGCAGC TGCGGATCGC CGGCTTCTGC TGGGACGTGC TCGGCCGGAT CGGCTTCACC
CGGGACGACA TGGTGATCGG CCTCGGCGGC GGCACCGTGA CAGACCTGGC CGGGTTCGTC
GCCGCGAGCT GGCTGCGCGG GGTGGACGTC GTCCAGGTGC CGACCACCGT GCTCGGCATG
GTCGACGCGG CGGTCGGCGG GAAGACCGGC ATCGACATCG ACGCGGGCAA GAACCTGGTC
GGGGCCTTCC ACCAGCCGCT CGGCGTGCTG TGCGATCTGG CGGCGCTGGA GTCTCTGCCG
GCCGTCGAGG TACGCGCCGG GCTCGCCGAG GTCGTCAAGA CCGGTTTCAT CGCCGACGCG
GCGATCCTCG ATCTGCTCGA CGCCGATCCG ACCGGCGCCG CGCATCTGCC CGAGCTGATC
GAGCGGTCCA TCCGGGTCAA GGCCGAGGTG GTCTCCGGCG ATCCGCGGGA GGACGGCCGG
CGGGAGATCC TCAACTATGG TCACACCCTC GGTCATGCCA TCGAGAAGGT CGAGCACTTC
AGCTGGCGGC ACGGCGCGGC GATCTCGGTG GGCATGGTGT TCGCGGCGGA GCTGTCCCGG
CTCGTCGTCG GCCTCGACGA CGCCACCGCC GACCGGCACC GTGAGCTGCT GACCCGCATC
GGCCTGCCGG TGACCTACCG GGACGACCGG TGGGCGGCGC TGCTCGACGC GATGCGGGTC
GACAAGAAGG CGCGGGGACG GCGGATGCGT TTCGTCGGCC TCGAGGCGCA GGGCCGCACC
GTCATCCTGG ACAATCCGGA CGCTGGCCTG CTGATCGCGG CGTTCACCAC TGTCGCCGAG
GGTCGGTAG
 
Protein sequence
MQVKASDIVR IPVRPGGGRP YDVVLGVGLL GELAETVVGR TRAAVIHPGA LRATADAVVA 
DLRENAGVEA HAIEVPDGEE AKQLRIAGFC WDVLGRIGFT RDDMVIGLGG GTVTDLAGFV
AASWLRGVDV VQVPTTVLGM VDAAVGGKTG IDIDAGKNLV GAFHQPLGVL CDLAALESLP
AVEVRAGLAE VVKTGFIADA AILDLLDADP TGAAHLPELI ERSIRVKAEV VSGDPREDGR
REILNYGHTL GHAIEKVEHF SWRHGAAISV GMVFAAELSR LVVGLDDATA DRHRELLTRI
GLPVTYRDDR WAALLDAMRV DKKARGRRMR FVGLEAQGRT VILDNPDAGL LIAAFTTVAE
GR