Gene Francci3_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0434 
Symbol 
ID3903623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp516423 
End bp517493 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content71% 
IMG OID637877766 
Product3-dehydroquinate synthase 
Protein accessionYP_479550 
Protein GI86739150 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACAG CACAGCGGAC GAGTCCGATC GAGCAGATCA CCGCCGATCA TCGACGGGTC 
TCGGTGGACC TCGGCTCACG CAGCTATCCG ATCGATATCG GTCCCGGGGT CCGCCGAATC
CTGCCCGACA TCGTCGCGCG GATCGGCGCC CGGCGGGCCG TCATCGTCTC GGCGCGGCCG
CAGGACGCGG TTCCCGACCC GGGCGTGCCG GTACTGCGGC TGGCCGCACG GGACGGCGAG
GCGGACAAGA ACCTGTCGAA CGTCGAGGCG CTCTGCGGCC GGTTCGCGGC TTTCGGGCTG
ACCCGGTCGG ACGTCGTCAT CTCCTGCGGC GGGGGAACCA CCACGGACGT CGTCGGGCTC
GCCGCGGCAT TGTACCACCG CGGCGTGGCG GTGGTACACC TCCCGACATC CCTGCTCGCC
CAGGTGGACG CGAGCGTCGG CGGGAAGACG GCGGTGAACC TGCCCGCCGG CAAGAACCTC
CTCGGTGCCT ACTGGCAACC CAGCGCCGTC CTCTGCGACA CCGACCACCT GCGGACCCTG
CCCCGGCGGG AGTGGATCAA CGGCTACGGC GAGATCGCGC GGGCCCACTT CATCGGCACC
GGGGACCTGC GCGGTCTGCC GGTGACGGAG CAGATCACCG CCAGCGTGGC GCTCAAGGCC
GCCGTCGTCG CTCGGGACGA ACGGGACTCA AGCCTGCGCC ACATCCTCAA CTACGGCCAC
ACCCTGGGGC ATGCCCTGGA GCGCGTCACC GACTTCGTGC TGCGCCACGG TGAGGCGGTG
GCGATCGGCA CCGTGTTCGC CGGCCGTCTC GCCGGGGAGC TCGGCCGGAT CGGGGACGAT
CGGGTCCGCG AGCACCTGGA CGTGGTCCGC GGCTACGGGC TGCCGACGGC CCTGCCCACC
GAGGCGGACG CCGCCGAACT CGTCGCCGTG ATGCGTCTGG ACAAGAAGTC GACGAACACC
GGCCTCACCT TCGTGCTCGA CGGTGCGGAC GGCCCACAGC TGGTGGGCGA CATTCCGGAG
GACCTGGTCA TGAAGACGCT CGGCGACATG CCGCGCGGGC CGCTGGCCTG A
 
Protein sequence
MLTAQRTSPI EQITADHRRV SVDLGSRSYP IDIGPGVRRI LPDIVARIGA RRAVIVSARP 
QDAVPDPGVP VLRLAARDGE ADKNLSNVEA LCGRFAAFGL TRSDVVISCG GGTTTDVVGL
AAALYHRGVA VVHLPTSLLA QVDASVGGKT AVNLPAGKNL LGAYWQPSAV LCDTDHLRTL
PRREWINGYG EIARAHFIGT GDLRGLPVTE QITASVALKA AVVARDERDS SLRHILNYGH
TLGHALERVT DFVLRHGEAV AIGTVFAGRL AGELGRIGDD RVREHLDVVR GYGLPTALPT
EADAAELVAV MRLDKKSTNT GLTFVLDGAD GPQLVGDIPE DLVMKTLGDM PRGPLA