Gene Franean1_4743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4743 
Symbol 
ID5673085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5664812 
End bp5665822 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content72% 
IMG OID641243600 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_001509016 
Protein GI158316508 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC CGAACATCTA TCTGCAGGAC GTCACCCTGC GGGACGGCAT GCACGCCATC 
CGGCACCGGG TCGACCCGGA GCGGGTCGGC GCCATCGTGG CCGCTCTCGA CAAGGCGGGC
GTCCGGGCGA TCGAGGTCAC CCACGGCGAC GGGCTGGCCG GCTCCAGCCT GACGTACGGC
CCCGGCAGCC ACACGAACTG GGAGTGGATC GAGGCGGCGG TCACCAACGC CTCGCAGGCG
ACGATCACCA CGCTGCTGCT ACCGGGCGTG GGCACGATCG CCGAGCTGCG CCGCGCGCAC
GCGATGGGGG TCGGCTCGGT CCGCGTCGCG ACGCACTGCA CCGAGGCGGA CGTCGCCGCC
CAGCACATCG CCGCGGCGCG CGAGCTCGGC ATGGACGTCT CCGGCTTCCT GATGATGAGC
CACATGGCCG AGCCGGCCGA GCTGGCCGCC CAGGCCAAGC TGATGGAGTC GTACGGGGCG
CACTGTGTCT ACGTCACCGA CTCGGGCGGC CGGCTGACCA CCGACCGCGT GCGCGAGCGG
GTCCGTGCCT ACCGCGACGT CCTGCGCCCG GACACCCAGA TCGGCATCCA CGCGCACGAG
AACCTCTCGC TGTCGGTCGC CAACTCGTTC GCCGCGGTCG AGGAGGGCGC CTACCGCGTC
GACGCGTCGC TCGCCGGCCA GGGAGCCGGC GCCGGCAACT GCCCGATCGA GCCGTTCGTC
GCGGTCGCGC TGCTGCTGGG CTGGGATCTC GACTGCGACC TGCTCGCGCT GGAGGACGCG
GCCGAGGACC TGGTCCGGCC GTTGCAGGAC CGTCCGGTCC GGGTCGACCG CGAGACGCTC
ACGCTCGGCT TCGCGGGCGT GTACTCCAGT TTCCTCCGGC ACGCCGAGAT CGCCGCCGAG
ACCTACGGCG TGGACGCCCG CAGCATCCTG ATCGAGGCGG GCCGGCGAAA GCTGGTCGGC
GGCCAGGAGG ACATGCTCGT CGACATCGCC CTGGCGATAC AGCCCAAGTA G
 
Protein sequence
MSGPNIYLQD VTLRDGMHAI RHRVDPERVG AIVAALDKAG VRAIEVTHGD GLAGSSLTYG 
PGSHTNWEWI EAAVTNASQA TITTLLLPGV GTIAELRRAH AMGVGSVRVA THCTEADVAA
QHIAAARELG MDVSGFLMMS HMAEPAELAA QAKLMESYGA HCVYVTDSGG RLTTDRVRER
VRAYRDVLRP DTQIGIHAHE NLSLSVANSF AAVEEGAYRV DASLAGQGAG AGNCPIEPFV
AVALLLGWDL DCDLLALEDA AEDLVRPLQD RPVRVDRETL TLGFAGVYSS FLRHAEIAAE
TYGVDARSIL IEAGRRKLVG GQEDMLVDIA LAIQPK