Gene Francci3_3632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3632 
Symbol 
ID3904188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4336969 
End bp4337997 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content71% 
IMG OID637880955 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_482713 
Protein GI86742313 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.347698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTG CGGTGATTGG CGGTGACGGG ATCGGCCCCG AGGTGGTGGC GGAGGGGTTG 
CGGGTGCTGC GCGCGGTACA TCCCAAGGTC GACACGACCG AATACGACCT GGGTGCCCGG
CGCTGGCACG AGACCGGGGA GACGCTGCCC GACTCGGTGC TGGAGGAACT GCGCGGTCAC
GACGCGATCC TGCTCGGGGC GGTCGGTGAT CCCGGCGTGC CCAGCGGTGT CCTCGAGCGT
GGCCTGCTGC TGCGGCTGCG GTTCGAGTTC GACCATCACG TCAATCTCCG GCCGGTGCGG
CTCTATCCGG GAGTCCGCTC GCCGCTGGCC GGAGATCCGG CCATCGACAT GATCGTGGTG
CGGGAGGGCA CGGAGGGGCC GTACGCGGGC GCGGGGGGTG TGCTCCGGAA GGGGACGCCC
CATGAGGTGG CGACCGAGGA GAGCCTCAAC ACCCGGTACG GCGTCGAGCG CGTCGTGCGC
GACGCGTTCC GGCGGGCCGA TCGACGCGAG CGTCGCCACC TGACTCTCGT GCACAAGAAC
AACGTGCTGA CCAAGGCCGG CGACCTGTGG TCGCGCACCG TGGCCGAGGT GGCGCCCGAG
TTCCCCGACG TGCGCGTCGA CTACCAGCAC GTGGACGCGG CCTCGATGTT CTTCGTGACC
GATCCGGGTC GGTTCGACGT CGTCGTGACG GACAACATGT TCGGTGACAT CCTCACCGAC
ATCGGCGCGG CCATCACCGG CGGGATCGGC CTGGCCGCCA GTGGCAACCT CGATCCCTCC
GGTGTCCACC CGAGCATGTT CGAGCCCGTG CACGGCAGCG CCCCGGATAT CGCCGGCAGG
CAACTCGCCG ACCCGACGGC CACCGTCGCC TCGGTGGCGA TGCTACTCGA TCATCTCGGC
CACGCCGAGG AGGCGGCGAA GGTCGAGGCC GCCGTCGCCT CCTCCCTGGC GGATCGTGCC
GCCGCGGGAG CGGCCCAGCC GTCGACCCGG GAACGTGGCG AGGACCTTGC CGCGCGGGCT
GCGGGCTGA
 
Protein sequence
MRLAVIGGDG IGPEVVAEGL RVLRAVHPKV DTTEYDLGAR RWHETGETLP DSVLEELRGH 
DAILLGAVGD PGVPSGVLER GLLLRLRFEF DHHVNLRPVR LYPGVRSPLA GDPAIDMIVV
REGTEGPYAG AGGVLRKGTP HEVATEESLN TRYGVERVVR DAFRRADRRE RRHLTLVHKN
NVLTKAGDLW SRTVAEVAPE FPDVRVDYQH VDAASMFFVT DPGRFDVVVT DNMFGDILTD
IGAAITGGIG LAASGNLDPS GVHPSMFEPV HGSAPDIAGR QLADPTATVA SVAMLLDHLG
HAEEAAKVEA AVASSLADRA AAGAAQPSTR ERGEDLAARA AG