Gene Franean1_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1113 
Symbol 
ID5669526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1330711 
End bp1332108 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content73% 
IMG OID641240045 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001505473 
Protein GI158312965 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.681996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.828695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGCA CACTCGCGGA GAAGGTCTGG GACGCACACG TCGTCCGCCG CGCGGACGGA 
GAGCCCGACC TGCTCTACAT CGATCTGCAC CTCGTCCACG AGGTCACCTC GCCGCAGGCG
TTCGAGGCGC TGCGGCTGGC CGGGCGCCCG GTCCGCCGTC CCGACCTGAC GCTGGCGACC
GAGGACCACA ACGTCCCGAC GACCGACACG CTGCTGCCGA TCGCCGACCC GGTCTCGCGG
GCGCAGGTCG AGGCGCTGCG CAAGAACTGC GCCGACTTCG GGGTCCGCCT GTTCCCGATG
AACGACCCGG ACCAGGGCAT CGTCCACGTG GTCGGCCCGC AGCTCGGCCT GTCCGAGCCG
GGCATGACGA TCGTCTGTGG CGACAGCCAC ACCTCGACGC ACGGCGCCTT CGGGGCGCTC
GCCTTCGGCA TCGGCACCAG CCAGGTCGAG CATGTGCTGG CCACCCAGAC GCTGCCGCAG
CGCCGCCCGA AGACGATGGC GGTCACCGTC CAGGGTGAGC TGCCCGCCGG GGTCACCGCG
AAGGACCTGA TCCTCGCCGT GATCGCCCGG ATCGGCACGG GTGGCGGCGC CGGCTACGTC
ATCGAGTACC GCGGCGAAGC CGTCCGCGGG CTGTCGATGG AGGGCCGGAT GACGGTCTGC
AACATGTCGA TCGAGGCGGG CGCGCGCGCC GGGATGATCG CCCCGGACGA GACCACGTTC
GAGTACCTCA GGGGACGTCC GAACGCCCCG ACCGGGGCCG ACTGGGACGC CGCGGTCGAG
TACTGGCGCA CCCTGGCCAC CGACCCCGAC GCCACGTTCG ACCACGAGGT CGTCATCGAC
GGGCCCAGCC TGAGCCCGTA CGTCACCTGG GGGACCAACC CGGGCCAGGC TGCGCCGCTG
AGCTCGCCCG TGCCCGACCC GGCCGCCTAT GCCGACCCGG CCGCGCGCGG CTCGGTGGAA
CGCGCCCTGG CCTACATGGA TCTCGTGCCG GGCACCCCGC TGTCCGACGT CGCCGTCGAC
ACCGTCTTCA TCGGATCCTG CACCAACGGC CGGATCTCCG ACCTGCGTGA CGCCGCCGAC
GTGCTGCGCG GGCGCCAGGT GGCGGACGGC CTTCGGGTCC TGGTCGTCCC CGGCTCGATG
GCGGTCAAGG CCGAGGCGGA GGCGGAGGGC CTCGACGAGG TGTTCCGCGC CGCGGGCGCC
GACTGGCGTA GCGCCGGCTG CTCGATGTGC CTGGGCATGA ACCCGGACAC CCTGCGGCCG
GGGGAGCGCA GCGCGTCGAC GTCCAACCGC AACTTCGAGG GCCGGCAGGG CCCCGGCGGG
CGAACCCACC TCGTCTCGCC GGCGGTCGCC GCCGCCACCG CGGTGACCGG CCGCCTCACG
GCGCCCGCCG ACCTGTAG
 
Protein sequence
MGRTLAEKVW DAHVVRRADG EPDLLYIDLH LVHEVTSPQA FEALRLAGRP VRRPDLTLAT 
EDHNVPTTDT LLPIADPVSR AQVEALRKNC ADFGVRLFPM NDPDQGIVHV VGPQLGLSEP
GMTIVCGDSH TSTHGAFGAL AFGIGTSQVE HVLATQTLPQ RRPKTMAVTV QGELPAGVTA
KDLILAVIAR IGTGGGAGYV IEYRGEAVRG LSMEGRMTVC NMSIEAGARA GMIAPDETTF
EYLRGRPNAP TGADWDAAVE YWRTLATDPD ATFDHEVVID GPSLSPYVTW GTNPGQAAPL
SSPVPDPAAY ADPAARGSVE RALAYMDLVP GTPLSDVAVD TVFIGSCTNG RISDLRDAAD
VLRGRQVADG LRVLVVPGSM AVKAEAEAEG LDEVFRAAGA DWRSAGCSMC LGMNPDTLRP
GERSASTSNR NFEGRQGPGG RTHLVSPAVA AATAVTGRLT APADL