Gene Emin_0749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0749 
Symbol 
ID6263374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp823787 
End bp824992 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content44% 
IMG OID642611224 
Productaminotransferase class I and II 
Protein accessionYP_001875641 
Protein GI187251159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000245467 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000666006 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTTAA GCAAACTTTC TTCCGTAGTG GCGGATTCCG CCACGCTAAA AATTAACGCA 
AAAGCCAACG CTTTAAAAAA ACAAGGCCTT CCTCTTATTC ATTTGGGCGG AGGCGAGCCT
GAATATCCCG CTCCCAAAGC GGCTGTTGAA GCGATTTTGG CCAAAGCTAA AACGGAAAAA
ATAAAATATT CCCCCACCAC AGGCACCGTG GATTTAAAAG AAGCTGTTAT CAAATATACC
AAAGATAATT ACGGCAAAAC GGTTGAAGCG CAAAATATTA TTATAAGCAG CGGCGCTAAA
CAGGCTATTT ATAATTTTTT ACTTGCTGCG GTTAACCCCG GCGACGAGGT TGTTTTCCCC
GTTCCTTATT GGGTAAGCTA CCCCGAAATG GTTACAATGG TAAGCGGCGT TCCCGTACCG
GTCAAGCCGG CAAACGGTTT AAAAGTTACT TTAGATGAAG TTAAAGCTAA AATAACCCCT
AAAACAAAAG CCGTAATGGT TAACAGCCCG AGCAACCCTT CAGGCATGAT TTTTGACGAA
GCCTTTATTA AAGGCATTGT TGAAACCTGC GAAGAAAAAG GCATTTTCCT TTTGATGGAC
GATATTTACA ACAAACTTGT GTTTGGCGGC GCGGAATGCC CCGTGGCTTT TAAATACGCC
AAAAACGCTG ATAATTTGGT TGTTATTAAC GGCGTAAGTA AACTTTACGG CTTAACCGGG
TTAAGAATAG GCTGGGCTGT AAGCGAAAAT AAAGACCTTA TTGCCGCTAT GGGCCGCATG
CAGGCGCAAA CAACATCATG TAACTGTGAT ATTTCGGAGG CGGCCGCAGC AGCCGTTTTA
AACGGAGACC AAAGCGTGGT AACCGACTTA AAAGCCCAGC TTGAGGAAAA CAGAAACGCT
TTAATGCAGG AGCTTAAACA AATTAAAGAC GTAGGCGTAA CTGTACCTGA AGGCACTTTT
TATACTTTGG TTGATTTTAG GGCTTACGGC AAGAATTCAA TGGAGCTTGC CCAGTTTTTA
CTTGAAAAAG CTTTAGTCGC TGTAGTGCCG GGCGACGCTT TCGGCTTAGA CGGTTACGCC
CGTATAAGCT ACTGCGCGTC TAAAAATAAC GTGGTTGAAG GGGCAAGGCG TATACGCTGG
GCTCTTGATA AAACCTCACC CGACGAAATA ACTATCGGCG GTAAAATTGT TAAAAGGACA
TGGTAA
 
Protein sequence
MNLSKLSSVV ADSATLKINA KANALKKQGL PLIHLGGGEP EYPAPKAAVE AILAKAKTEK 
IKYSPTTGTV DLKEAVIKYT KDNYGKTVEA QNIIISSGAK QAIYNFLLAA VNPGDEVVFP
VPYWVSYPEM VTMVSGVPVP VKPANGLKVT LDEVKAKITP KTKAVMVNSP SNPSGMIFDE
AFIKGIVETC EEKGIFLLMD DIYNKLVFGG AECPVAFKYA KNADNLVVIN GVSKLYGLTG
LRIGWAVSEN KDLIAAMGRM QAQTTSCNCD ISEAAAAAVL NGDQSVVTDL KAQLEENRNA
LMQELKQIKD VGVTVPEGTF YTLVDFRAYG KNSMELAQFL LEKALVAVVP GDAFGLDGYA
RISYCASKNN VVEGARRIRW ALDKTSPDEI TIGGKIVKRT W