Gene Franean1_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1098 
Symbol 
ID5669512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1311667 
End bp1312695 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content74% 
IMG OID641240030 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001505460 
Protein GI158312952 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.51179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTG CGGTCATTGG TGGCGACGGA ATCGGCCCGG AAGTGGTCGC GGAGGGGCTG 
CGCGTGCTAC GCGCCGTGCA TCCCAAAGTG GAGACCACCG ACTACGACCT GGGCGCGCGG
CGCTGGCACG AGACCGGCGA GACCCTGCCC GACAGCGTCC TGGCGGAGCT GCGCGGGCAC
GACGCGATCC TGCTCGGCGC TGTCGGCGAC CCCGGGGTAC CCAGCGGCGT CCTGGAACGC
GGGCTGCTGC TGCGGCTGCG GTTCGAGCTG GACCACCACG TCAACCTCCG GCCGGTCCGG
CTGTACCCCG GAGTGACCTC ACCGCTCGCC GGTGACCCCG CCATCGACAT GATCGTGGTG
CGGGAGGGGA CGGAGGGCCC CTACGCCGGC GCCGGCGGCA CTCTGCGGCG CGGGACGCCG
CAGGAGGTGG CGACCGAGGA GAGCCTGAAC ACGCGCTTCG GGGTCGAGCG GGTCGTGCGC
GACGCCTTCG CGCGGGCGAG CCGGCGTCCC CGCGCCCACC TCACCCTGGT ACACAAGACC
AACGTGCTCA CCAAGGCGGG CGACCTGTGG GCCCGCACGG TCGCCGAGGT CGGCGCCGAG
TTCCCCGCCG TCAGCGTCGA CTACCAGCAC GTGGACGCCG CGTCGATGTT CTTCGTGACC
GACCCGGCCC GGTTCGACGT GGTGGTCACC GACAACATGT TCGGCGACAT CCTGACCGAC
ATCGGCGCCG CGATCACCGG CGGCATCGGG CTCGCCGCCA GCGGGAACCT CGACCCCTCG
GGCGCGAACC CGAGCATGTT CGAGCCGGTC CACGGCAGCG CTCCCGACAT CGCCGGCCAG
GGGCTGGCGG ACCCGACCGC GACGGTCGCC TCGGTCGCGA TGCTGCTGGA CCACCTCGGC
CACGCCGACG AGGCGGCCCG GGTGGAGGGC GCGGTGGCCG CGTCGCTGGC CGCCCGCGCC
GCGGCCGGCG GTGCCCGCCG CTCCACCCGC GAGATCGGCG ACGACCTGGC CACCCGCGCC
GCGGGCTGA
 
Protein sequence
MRLAVIGGDG IGPEVVAEGL RVLRAVHPKV ETTDYDLGAR RWHETGETLP DSVLAELRGH 
DAILLGAVGD PGVPSGVLER GLLLRLRFEL DHHVNLRPVR LYPGVTSPLA GDPAIDMIVV
REGTEGPYAG AGGTLRRGTP QEVATEESLN TRFGVERVVR DAFARASRRP RAHLTLVHKT
NVLTKAGDLW ARTVAEVGAE FPAVSVDYQH VDAASMFFVT DPARFDVVVT DNMFGDILTD
IGAAITGGIG LAASGNLDPS GANPSMFEPV HGSAPDIAGQ GLADPTATVA SVAMLLDHLG
HADEAARVEG AVAASLAARA AAGGARRSTR EIGDDLATRA AG