Gene Francci3_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3640 
Symbol 
ID3905321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4346845 
End bp4348671 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content70% 
IMG OID637880963 
Productacetolactate synthase 1 catalytic subunit 
Protein accessionYP_482721 
Protein GI86742321 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAG AGATCACCGG AGCCCAGTCG CTCGTCCACT CCCTTGAAGC GGTCGGAGCC 
GATGTGGTCT TCGGCATCCC CGGCGGCGCG ATCCTGCCCG CATATGATCC ACTGTTCGAC
TCCACCCGGG TGCGACATGT CCTTGTCCGG CACGAGCAGG GCGCGGGTCA TGCGGCCGAG
GGGTACGCGC AGGCCACCGG GCGGGTCGGG GTGTGCATGG CGACGTCGGG GCCGGGGGCG
ACCAACCTGG TGACTCCGAT CGCCGACGCC TACATGGACT CGGTGCCGAT CGTGGCCGTC
ACCGGGCAGG TCCCCAGTCC CTCGATCGGG ACCGACGCCT TCCAGGAGGC GGACATCTGC
GGCATCACCC TGCCGATCAC CAAGCACAAC TTCCTGGTCC AGTCCGCCGA CGACATCCCG
CGGATCATCG CGGAGGCGTT CCACCTGGCG GCGACGGGCC GGCCCGGACC GGTGCTCGTC
GACCTGCCCA AGGACATCCT GCAGTCGGCC ACCTCCGTGC TGCCGCACGA TGTCTGGCCG
CCGGCTCTCG ACCTGCCCGG TTATCGTCCG GTCACCCGCC CGCACGGCAA GCAGGTGCGG
GAGGCGGCGA AGATGATCAG TGCTGCCCGG CGTCCGGTGC TCTATATCGG TGGGGGAGTG
CTCAGGGCCC GCGCTGCCGC GGAGCTGCGG ACACTGGCGG AGCTCACCGG CATCCCGGTG
GTCACCACCC TCATGGCCCG CGGCGCCTTC CCCGACTCCC ATCCCCAGCA CCTGGGCATG
CCCGGCATGC ACGGGTCGGT CGCCGCGGTG ACCGCGTTGC AGAAGGCGGA TCTGCTGATC
ACTCTCGGGG CCCGCTTCGA TGACCGGGTG ACCGGCAGGC TGTCCTCCTT CGCGCCCGGT
GCCGCGGTCA TTCACGCCGA CATCGACCCG GCCGAGATCG GCAAGAACCG GACCGCGGAC
GTACCGATCG TCGGTGACTG CCGTGATGTG ATCAACGAGC TCGTCGCCGC GCTCGTCCTG
GAGGAGCGGC CGGACCTCGC CGCCTGGTGG CGGACGCTGG ACGGCTGGCG CCGGACCTAC
CCGCTGGGCT ACGACCAGCC GGCTGACGGC TCGCTGGCCC CGCAGTACGT CATCGAGCGG
CTCGGGAAGA TCGCGGGACC GGAGACCATC TTCGCCGCCG GCGTCGGGCA GCACCAGATG
TGGGCCGCGC AGTTCATCTC CTACGAGAAC CCCTACACCT GGCTGAACTC CGGCGGCGCC
GGCACGATGG GCTACGCCGT GCCGGCCGCG ATGGGCGCCA GGGTCGGCCG CCCCGACGCC
ACGGTGTGGG CCGTCGACGG TGACGGCTGC TTCCAGATGA CCAACCAGGA GCTTGCGACC
TGCGCGCTGG AGGGAATCCC GATCAAGGTT GCCGTCATCA ACAACGGGTC GCTCGGCATG
GTCCGCCAGT GGCAGACCCT CTTCTACGAC AAGCGGTACT CCAACACGGA GCTCGGGACG
CACCCGGACT CCCCGCGGAC CGGCGGCGCA GGGGCGTCTG GGCGGGGCTC AGGGGCGTCC
AGGAGGGTGC GGGTGCCCGA CTTCGTCCGG CTGGCCGAGG CGTTGGGCTG CGTCGGGCTG
CGCTGCGAGA CGGCGGCGGA TGTCGACGCG ACGATCGAGA AGGCGATGGC GATCGACGAC
GCCCCGGTCG TGGTGGACTT CGTCGTGCAT CCCGACGCCA TGGTGTGGCC GATGGTCGCC
GCCGGCGCCA GCAACGACGA CATCCGCATC GCGCGCGACA CCGCGCCGGA CTTCGACTAC
TCCGGCGACG CCGAGGTGAA CATCTGA
 
Protein sequence
MTREITGAQS LVHSLEAVGA DVVFGIPGGA ILPAYDPLFD STRVRHVLVR HEQGAGHAAE 
GYAQATGRVG VCMATSGPGA TNLVTPIADA YMDSVPIVAV TGQVPSPSIG TDAFQEADIC
GITLPITKHN FLVQSADDIP RIIAEAFHLA ATGRPGPVLV DLPKDILQSA TSVLPHDVWP
PALDLPGYRP VTRPHGKQVR EAAKMISAAR RPVLYIGGGV LRARAAAELR TLAELTGIPV
VTTLMARGAF PDSHPQHLGM PGMHGSVAAV TALQKADLLI TLGARFDDRV TGRLSSFAPG
AAVIHADIDP AEIGKNRTAD VPIVGDCRDV INELVAALVL EERPDLAAWW RTLDGWRRTY
PLGYDQPADG SLAPQYVIER LGKIAGPETI FAAGVGQHQM WAAQFISYEN PYTWLNSGGA
GTMGYAVPAA MGARVGRPDA TVWAVDGDGC FQMTNQELAT CALEGIPIKV AVINNGSLGM
VRQWQTLFYD KRYSNTELGT HPDSPRTGGA GASGRGSGAS RRVRVPDFVR LAEALGCVGL
RCETAADVDA TIEKAMAIDD APVVVDFVVH PDAMVWPMVA AGASNDDIRI ARDTAPDFDY
SGDAEVNI