Gene M446_3580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3580 
Symbol 
ID6134378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3996692 
End bp3997903 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content78% 
IMG OID641643747 
Producthomocitrate synthase 
Protein accessionYP_001770395 
Protein GI170741740 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.483441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0700115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAC CGTCCTCCGC CTCCCCTCCC CCCGCACCGT CCGGGCCGCC CTCCGCCCGG 
ACCGTGTTCC TCAACGACAC CACCCTGCGC GACGGCGAGC AGGCCCCGGG CGTCGCCTTC
ACCCGCCGCG AGAAGATCGA GATCGCCGAG GCCCTCGCGG CGGCCGGGGT CCCGGAGATC
GAGGCCGGCA CGCCGGCCAT GGGCGAGGAC GAGATCGAGA CGATCCGCTC CATCGTCTCG
CTGCGGCTGC CCCTGCGGGT GATCGCGTGG TGCCGGATGC GCGAGGACGA CCTCCTCGCC
GCGGTCGCCG CGGGCGTGCC GGCGGTCAAC CACTCGATCC CGGTCTCGGA CGCCCAGCTG
CGCGGCAAGC TCGGCCGCGA CCGCGCCTTC GCCCTCGACG CGGTCGCCGC GACGGTGGCG
CGCGCGCGGC GCCTCGGCCT CGCGGTGGCG GTCGGCGCCG AGGACGCCTC GCGCGCCGAT
CCCGACTTCC TCTGCCGCGT CGCCGAAGCG GCGCGGGCGG CGGGCGCCGA GCGCCTGCGC
CTCGCCGACA CGCTCGGCGT GCTCGATCCC TTCGCCGCCG ACGCCCTGGT CCGGCGCCTC
GCCGCGGCCA CCGACCTCGC CCTCGAATTC CACGCCCACG ACTATCTCGG CCTCGCCACC
GCCAACACGC TGGCGGCGCT GCGGGCGGGG GCGCGCCACG CCAGCGTCAC CGTGACGGGG
CTCGGCGAGC GGGCCGGCAA TGCCGCCCTG GAGGAGGTGG CGGTGGCGCT GGCGCGGTTC
GGCCAGGGGC CGACCGGGAT CGACCTTCGC GCGCTGCGCC CGCTCGCCGC CGCCGTCGCG
GCGGCGGCCG AGCGTCCCCT GCCGCGCGGC AAGGCCATCG TGGGCGAGGA CATCTTCACC
CACGAATCCG GCATCCACGT CGCCGGGCTG CTGCGGGACC GGGCGACCTA CGAGGCGCTC
GATCCCGGGA TGCTCGGGCG CAGCCACCGC ATCGTGATCG GCAAGCATTC GGGGGTGGCG
GCGCTCGCCA GCGCCCTCGC GGCGCAGGGG CGCAGCCTCG ACGCGGAGGT CGCCCGCGAC
CTCCTGGAAC GGGTCCGGGC GGCGGCGGTG CGCACCAAGG CGGCGGTGCC GCCGGGCCTG
CTGCGGCGCC TCCACGACGA GTGCCTGATG AGCGCGCGGC CGCTGCCGCG CTTCGCCGCC
GCGGGGAGCT GA
 
Protein sequence
MPEPSSASPP PAPSGPPSAR TVFLNDTTLR DGEQAPGVAF TRREKIEIAE ALAAAGVPEI 
EAGTPAMGED EIETIRSIVS LRLPLRVIAW CRMREDDLLA AVAAGVPAVN HSIPVSDAQL
RGKLGRDRAF ALDAVAATVA RARRLGLAVA VGAEDASRAD PDFLCRVAEA ARAAGAERLR
LADTLGVLDP FAADALVRRL AAATDLALEF HAHDYLGLAT ANTLAALRAG ARHASVTVTG
LGERAGNAAL EEVAVALARF GQGPTGIDLR ALRPLAAAVA AAAERPLPRG KAIVGEDIFT
HESGIHVAGL LRDRATYEAL DPGMLGRSHR IVIGKHSGVA ALASALAAQG RSLDAEVARD
LLERVRAAAV RTKAAVPPGL LRRLHDECLM SARPLPRFAA AGS