Gene Franean1_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1833 
Symbol 
ID5670235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2199164 
End bp2200522 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content72% 
IMG OID641240754 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001506177 
Protein GI158313669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0830687 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.174475 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAACG ACCTGGACAT CTGGCGGAGC CTCCCAGCCA GGCAGCAGCC CTCCTGGCCC 
GACGGGGAGG AGCTGGCGGC GGCGTTCGCC GAGCTCTCGG CCCTGCCGCC GCTGGTGACC
GCGCCGGAGG TGCGTTCGCT GACCGACCGT CTCGCGATGG TGGCCCGCGG CGAGGCGTTT
CTGCTCCAGG GCGGTGACTG CGCGGAGACC TTCGCGGCCA ACACCGCAGA CAAGATCCGC
GACAAGGTCA AGACCCTGCT GCAGATGGCG GTCGCCCTGA CCTACGGCGC CAGCACGCCG
GTGGTCAAGG TGGCCCGCAT CGCCGGCCAG TACGCCAAGC CGCGCTCGGC CGACATCGAG
GCCTCGACCG GGCTGCCCTC CTACCGGGGC GACGCGGTGA ACGACATCGC GCCGAACGCG
CAGGCCCGCC GTCCGAACCC GCGGCGGATG GTCGACGCCT ACCACCAGAG CGCCGTCGCG
CTGAACCTGG TGCGGGCGTT CGCGACCGGC GGCTTCGCCG ACCTGTCCAA GGTCCACGAG
TGGAACAAGG CGTTCGTGCG CGACTCGGCC GCCGGCCGCC GCTACGAGCT GATGGCCGTC
GACATCGAGC GCGCCCTCGC GTTCATGGCC GCCTGCGGCA TCGACCTCGA CCGGACGGCC
GCGCTGACCG GCGTCGAGAT GTTCACCAGC CACGAGGGCC TGCTGATGGA GTACGAGCGG
GCGCTCACCC GCACCGAGGA GTCGACCGGC GAGGTCTACG ACCTGTCGGC GCACATGATC
TGGATCGGCG AGCGCACCCG TGACCTCGAC GGCGCCCACG TCGACTTCCT CTCCCGGGTC
GGCAACCCGA TCGGCTGCAA GATCGGCCCG ACGGCAACGC CGGACGAGGT CGTGGCGCTC
ACCGAGCGGC TCAACCCCGA CCACATCCCC GGCCGGCTGA CGCTGATCGC GCGGATGGGC
GCCAAGCGGG TGCGCGACGC CCTCCCGCCG ATCATCGACA AGGTGAACGC GGCCGGGCAC
CCCGTGGTGT GGTCGTGCGA CCCGATGCAC GGCAACACCC GCGACGTCGG CGGCGTGAAG
ACCCGGCACT TCGATGACGT CCTCGACGAG GTCTTCGGGT TCTTCGAGGT GCACAAGGGG
CTCGGGACGC ACCCCGGTGG CCTGCACATC GAGCTGACCG GCGAGAACGT CACCGAGTGC
CTCGGCGGCG CGGAGATGAT CGGCGAGGCC GACCTCGGTG GCCGCTACGA GACGGCCTGC
GATCCGCGGC TGAACACCGG CCAGGCGCTC GAGCTGGCCT TCCTGGTGGC CGAGTCGCTG
CAGCAGGCCC GCGCGGAGCG CGACACACAC ACCCGCTGA
 
Protein sequence
MSNDLDIWRS LPARQQPSWP DGEELAAAFA ELSALPPLVT APEVRSLTDR LAMVARGEAF 
LLQGGDCAET FAANTADKIR DKVKTLLQMA VALTYGASTP VVKVARIAGQ YAKPRSADIE
ASTGLPSYRG DAVNDIAPNA QARRPNPRRM VDAYHQSAVA LNLVRAFATG GFADLSKVHE
WNKAFVRDSA AGRRYELMAV DIERALAFMA ACGIDLDRTA ALTGVEMFTS HEGLLMEYER
ALTRTEESTG EVYDLSAHMI WIGERTRDLD GAHVDFLSRV GNPIGCKIGP TATPDEVVAL
TERLNPDHIP GRLTLIARMG AKRVRDALPP IIDKVNAAGH PVVWSCDPMH GNTRDVGGVK
TRHFDDVLDE VFGFFEVHKG LGTHPGGLHI ELTGENVTEC LGGAEMIGEA DLGGRYETAC
DPRLNTGQAL ELAFLVAESL QQARAERDTH TR