Gene Francci3_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3084 
Symbol 
ID3904210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3654389 
End bp3655789 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content70% 
IMG OID637880405 
Product3-deoxy-D-arabinoheptulosonate-7-phosphate synthase 
Protein accessionYP_482170 
Protein GI86741770 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACCG CTCGAGCTCG ACCTGGGCCG TACCCTGGTG TGGTGAGCGT ACTGGATCTC 
TGGCGGACCC TTCCGGCCAA GCAGCAGCCG TCCTGGCCCG ACCCGGCAGA GTTGAGAGCC
GCCCATGACC AGCTTGCGGC CCTCCCCCCG CTGGTCACCG CGCCCGAGAT CCGGTCCCTG
ACCGACCGTC TCGCGCTGGT CGCTCGGGGC GAGGCGTTCC TGTTGCAGGG CGGTGACTGC
GCCGAGACCT TCGTGGCGAA CACCGCCGAC AAGATTCGGG ACAAGGTCAA GACGCTGCTG
CAGATGGCGG TCGTCCTCAC CTATGGCGCG AGCACGCCGG TGGTGAAGGT CGCCCGCATC
GCCGGGCAGT ACGCCAAGCC GCGGTCCGCG GACATCGAGG CCAGCACCGG GCTGCCCTCC
TACCGGGGCG ACGCCGTCAA CGACATCGCG CCCACGGCCG CCGCGCGGCA GCCCGATCCG
ATGCGGATGG TCGCCGCCTA CCACCAGAGC GCAGTCGCGC TGAACCTCGT GCGGGCCTTC
GCCACCGGAG GTTTCGCCGA CCTGTCGAAG GTCCACGAGT GGAACAAGGC GTTCGTGCGG
GATTCCGCGG CGGGTCGCCG CTACGAGGTG ATGGCGACGG ACATCGAGCG TGCCCTCGCC
TTCATGGCCG CCTGCGGCAT CGACCTCGAT CGGACCGCCG CGCTGACCGG GGTCGAGCTG
TTCACGAGTC ACGAGGGCCT CCTGCTGGAG TACGAGCGTG CCCTGACCCG GATCGAGGAC
GCCACCGGCG ACCCGTACGA CCTGTCCGCC CACATGATCT GGATCGGCGA GCGGACCCGG
GACCTCGACG GTGCCCACGT CGACCTGCTG TCCCGGGTGG GCAATCCGAT CGGCTGCAAG
ATCGGGCCGA AGGCTGGCCC GGACGAGGTC CTCGAGCTTG CCGAGCGGCT CAACCCCGAC
CATGTCCCCG GCCGGCTCAC CCTGATCTCC CGGATGGGGG CAAAGCGGGT CCGGGACACG
CTGCCCCCGA TCATCGAGAA GGTCAACGCC GCCGGGCCGC CAGTGGTGTG GTCGTGCGAT
CCGATGCACG GCAACACCCG TGACGTGGGC GGCGTCAAGA CGCGCCACTT CGACGATGTC
CTCGACGAGG TCTTCGGCTT CTTCGAGGTC CACAAGGCGT TGGGCACCCA CCCCGGTGGC
CTGCACATCG AGCTGACCGG GGAGAACGTC ACCGAGTGCC TCGGTGGGGC CGAACTCATC
GGGGAAGACG ACCTCGGCGG CCGTTACGAG ACCGCCTGCG ACCCCCGTCT GAACACCGGG
CAGGCGCTAG AACTGGCCTT CCTCGTCGCC GAGATGCTGC AGAACACGCG GGGAGAACGC
GGCGCGCCGT GGCCCTCGTG A
 
Protein sequence
MVTARARPGP YPGVVSVLDL WRTLPAKQQP SWPDPAELRA AHDQLAALPP LVTAPEIRSL 
TDRLALVARG EAFLLQGGDC AETFVANTAD KIRDKVKTLL QMAVVLTYGA STPVVKVARI
AGQYAKPRSA DIEASTGLPS YRGDAVNDIA PTAAARQPDP MRMVAAYHQS AVALNLVRAF
ATGGFADLSK VHEWNKAFVR DSAAGRRYEV MATDIERALA FMAACGIDLD RTAALTGVEL
FTSHEGLLLE YERALTRIED ATGDPYDLSA HMIWIGERTR DLDGAHVDLL SRVGNPIGCK
IGPKAGPDEV LELAERLNPD HVPGRLTLIS RMGAKRVRDT LPPIIEKVNA AGPPVVWSCD
PMHGNTRDVG GVKTRHFDDV LDEVFGFFEV HKALGTHPGG LHIELTGENV TECLGGAELI
GEDDLGGRYE TACDPRLNTG QALELAFLVA EMLQNTRGER GAPWPS