Gene Emin_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0401 
Symbol 
ID6262478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp428128 
End bp429378 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content45% 
IMG OID642610868 
Productpyruvate carboxyltransferase 
Protein accessionYP_001875295 
Protein GI187250813 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.3494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA AACTTATCTA TATAGTAGAC GTTACCAACC GGGACGGGGT GCAAACTTCA 
AGGCTCGGTC TTGCTAAGCT GCAAAAAACG CTTGTAAATA TTTACTTAGA CGAAATGGGG
ATTACCCAGT CTGAATTTGG TTTTCCTACC ACTCAGCATG AAATTAATTA CCTTAACGCT
AATTTAGACT TGGCTGAACG AGGCGGCATA AAAAAAACCG TTTTAAGCGG CTGGATGCGC
GCGCTTGAGA GCGATGTTGT TTTGGCTTTT AAAAACGTTC CTAAATTAAA AACGGTGAAC
TTATCTATTT CCACTTCAGA CCAAATGATA CAAGGAAAAT TCGGCGGGCG CAAAACAAGA
AAGGATATTA TAAACCAAAT GACCGCCGCC GTAAAAAAAG CTTATGAATG CGGCGCGGAG
CTTGTGGGGG TAAACTCGGA AGATGCTTCC AGAACGGACC AGGAATACTT GGTTGAGTTT
GCTTTAGCCG CTAAAGCCGC TGGCGCAAAG AGATTAAGAT ACTGCGACAC TTTGGGCTAT
GATTCGCCAG ATGTGGCATA TTCCAGATTA AATGATTTGG CGAAAAGAAC GTGTTTGGAC
CTGGAAATGC ATTTTCATAA TGACCTGGGC ATGGCTGTAG CCAACTCCGT ACGCGGGGCG
CAGGGCGCTA TTGATGGCGG CGTTAACGCT TATATAAACA CGGCGGTTAA CGGCATGGGC
GAAAGGTCCG GCAACGCCGA TTTGGCATGT TGCGTTTTGG CTGTTTTAAA ATCAAGCGGG
TTTTCCGGTA AATATAAAAT TGACCCTGAT ATCGATATGT CAAAAATATG GCCGCTTGCC
AAATACACAT CTTACTCATT TGGCGTACCT ATACCTATTA ACTACCCCGC GGTGGGTGGC
AACGCTTTCG CGCATGAGTC TGGTATTCAC GCGGACGGCG CTTTAAAAGA CCGGCGTAAT
TATGAACTTT ATGATTATGA GGAGTTAGGC CGCGGCGAAC CTGAAGTTAT TGAAACAGGC
CGCATGATTA CCACGGGGGA ATACGGCGGT ATTAAAGGTT TTAGGGACGT GTATGAAAAA
CTGTCCATAG AATTTAAAGA CGAAAAAGAA GCCAGAAATA TTTTGGAACT TGTGCGCTAC
GCCAACGTTC ACACGCAGCT TCCTTTAATG GACAGCGAAC TTAGGTTTAT TTACAATTAC
CCGGACTTGG CGGCGCTTAT TATGACAACT ACGCCGCAGT ATGAAAAATA A
 
Protein sequence
MSDKLIYIVD VTNRDGVQTS RLGLAKLQKT LVNIYLDEMG ITQSEFGFPT TQHEINYLNA 
NLDLAERGGI KKTVLSGWMR ALESDVVLAF KNVPKLKTVN LSISTSDQMI QGKFGGRKTR
KDIINQMTAA VKKAYECGAE LVGVNSEDAS RTDQEYLVEF ALAAKAAGAK RLRYCDTLGY
DSPDVAYSRL NDLAKRTCLD LEMHFHNDLG MAVANSVRGA QGAIDGGVNA YINTAVNGMG
ERSGNADLAC CVLAVLKSSG FSGKYKIDPD IDMSKIWPLA KYTSYSFGVP IPINYPAVGG
NAFAHESGIH ADGALKDRRN YELYDYEELG RGEPEVIETG RMITTGEYGG IKGFRDVYEK
LSIEFKDEKE ARNILELVRY ANVHTQLPLM DSELRFIYNY PDLAALIMTT TPQYEK