Gene Franean1_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1090 
Symbol 
ID5669504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1301369 
End bp1303165 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content72% 
IMG OID641240022 
Productacetolactate synthase 1 catalytic subunit 
Protein accessionYP_001505452 
Protein GI158312944 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAG AGACCAGACA GATCACCGGA GCGCAGTCGC TCGTCCACTC GCTCGAGGCG 
GTCGGTGCCG ACGTGGTCTT CGGCATTCCG GGCGGGGCGA TCCTGCCCGC CTACGATCCG
CTGTTCGACT CCACCCGGGT CCGGCACGTG CTCGTCCGGC ACGAGCAGGG CGCCGGGCAC
GCCGCCGAGG GGTATGCCCA GGCCACCGGG CGTGTCGGGG TCTGCATGGC GACGTCCGGG
CCGGGCGCGA CGAACCTCGT GACACCGATC GCGGACGCCT ACATGGACTC GGTGCCCATC
GTCGCCATCA CCGGGCAGGT GCCGAGCGCG GCGATCGGCA CCGACGCCTT CCAGGAAGCC
GACATCTGCG GCATCACGCT GCCGATCACC AAGCACAACT TCCTGGTGCA GTCGGCGGAC
GACATCCCAC GCACGATCGC CGAGGCGTTC CACATCGCCT CGACGGGGCG CCCGGGCCCG
GTGCTGGTCG ACCTGCCGAA GGACATCCTG CAGGCGGTCA CGTCGGTCCT GCCGCACGAG
GTGTGGCCGC CGACGCTCGA CCTGCCCGGC TACCGCCCGG TCACCCGGCC GCACGCCAAG
CAGGTCCGCG AGGCGGCCAA GATGATCAGT GCGGCGCGCC GGCCGGTGCT CTACGTGGGC
GGTGGGGTGC TGAAGGCCCG TGCCGCCGCC GAGCTGCGGG TGCTCGCCGA GCTCACCGGC
ATCCCGGTGG TCACCACGCT GATGGCGCGC GGCGCGTTCC CCGACTCCCA CCCGCAGCAC
CTGGGCATGC CCGGGATGCA CGGCTCCGTC GCCGCGGTCA CCGCGATGCA GAAGGCCGAC
CTGCTGATCA CCCTGGGGGC CAGGTTCGAC GACCGGGTCA CCGGCAAGCT GTCGTCATTC
GCGCCCGGCG CGGCCGTCAT CCACGCCGAC ATCGACCCGG CCGAGATCGG CAAGAACCGG
ATCGCCGACG TGCCGATCGT GGGCGACTGC CGGGAGGTCA TCGTCGACCT GACCGCGGCA
CTGCGCACCG AGGAGCGCCC CGACCTCGAG GGCTGGTGGC GGTCGCTGGA CAGGTGGCGC
GCGACCTACC CGCTCGGCTA CGACCAGCCC GCCGACGGCT CGCTCGCCCC GCAGCAGGTC
ATCGAGCGGA TCGGCCGGAT CGCCGGCCCG GAGACCGTTT TCGCGGCCGG GGTGGGGCAG
CACCAGATGT GGGCCGCCCA GTTCATCTCC TACGAGCACC CCTACACCTG GCTGAACTCC
GGCGGCGCCG GCACCATGGG CTACGCCGTG CCGGCCGCGA TGGGCGCCAA GGTCGGCCGC
CCGGAGGCCA CCGTGTGGGC GATCGACGGC GACGGCTGCT TCCAGATGAC CAACCAGGAG
CTCGCCACCT GCGCCCTGGA GGGCATCCCG ATCAAGGTCG CGGTCATCAA CAACGGTTCG
CTGGGCATGG TCCGGCAGTG GCAGACGCTC TTCTACGACA AGCGCTACTC GAACACCGAC
CTGGGCACCC ACCCGGTGTC GGCGCGCACC GGGGTGACCC GGGTCCCCGA CTTCGTGCGC
CTCGCCGAGG CCCTGGGCTG CGTCGGCCTG CGCTGTGAGT CGCCCGCGGA CGTGGACGCC
ACGATCGAGA AGGCGATGAG CATCGACGAC GCCCCGGTCG TGGTCGACTT CGTGGTCCAC
CCCGACGCGA TGGTGTGGCC GATGGTGGCC GCCGGCTCGA GCAACGACGA GATCCGCGTC
GCCCGCGACA TCGCCCCCGA CTTCGACTAC TCCGGCGACG CGGAGGTCAA CCTGTGA
 
Protein sequence
MTRETRQITG AQSLVHSLEA VGADVVFGIP GGAILPAYDP LFDSTRVRHV LVRHEQGAGH 
AAEGYAQATG RVGVCMATSG PGATNLVTPI ADAYMDSVPI VAITGQVPSA AIGTDAFQEA
DICGITLPIT KHNFLVQSAD DIPRTIAEAF HIASTGRPGP VLVDLPKDIL QAVTSVLPHE
VWPPTLDLPG YRPVTRPHAK QVREAAKMIS AARRPVLYVG GGVLKARAAA ELRVLAELTG
IPVVTTLMAR GAFPDSHPQH LGMPGMHGSV AAVTAMQKAD LLITLGARFD DRVTGKLSSF
APGAAVIHAD IDPAEIGKNR IADVPIVGDC REVIVDLTAA LRTEERPDLE GWWRSLDRWR
ATYPLGYDQP ADGSLAPQQV IERIGRIAGP ETVFAAGVGQ HQMWAAQFIS YEHPYTWLNS
GGAGTMGYAV PAAMGAKVGR PEATVWAIDG DGCFQMTNQE LATCALEGIP IKVAVINNGS
LGMVRQWQTL FYDKRYSNTD LGTHPVSART GVTRVPDFVR LAEALGCVGL RCESPADVDA
TIEKAMSIDD APVVVDFVVH PDAMVWPMVA AGSSNDEIRV ARDIAPDFDY SGDAEVNL