Gene Mlg_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1087 
Symbol 
ID4270032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1266553 
End bp1267731 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID638125839 
Productaminotransferase 
Protein accessionYP_741929 
Protein GI114320246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATATCA AACTGGCGAA TCGGGTTCAA CGTGTGAAGC CCTCTCCCAC TCTGGCGGTG 
ACCGCCAAGG CGGCGGAACT GCGCGCCGCG GGCAAGGACA TCATCGGCCT GGGGGCCGGC
GAGCCTGATT TCGACACGCC TGAGCATATC AGAGACGCAG CCATAACGGC GATCAACGAG
GGTGAGACCC GCTACACCCC CGTCGACGGC ACTCCGGCGC TGAAGAAGGC CGTGATCAAT
AAGTTCAAGC GCGAGAACGG CCTGGACTAC GATGGCAAGC AGGTGCTGGT CTCCTCCGGC
GCCAAGCACT CGCTGTACAA CCTGATGTGC GCCCTGCTCA ACGAGGGCGA CGAGGTGATC
ATCCCGGCGC CCTACTGGGT GTCCTACCCG GACATGGCCA AGCTGGCCGA CGCCGAGCCG
GTGATCATCG AGGCCGGTCA GGAGCAGGGG TTCAAGATCA CCCCCGAGCA GCTGGAGGGC
GCGATCACTG ACCGCACCCG GCTGTTCGTG ATCAACAGTC CGTCCAATCC CACCGGTTCC
GCCTACAGCA AGGCCGAGCT GGCCGCGCTG GGCGAGGTGT TGAAGAAGCA TCCGCAGATC
GTGGTGGTCA CCGACGATAT CTACGAGCAC ATCCTGTTCG AGGGTGAGTT CGTCAACATC
GTCAACGCCT GCCCGGAGCT GAAGGACCGC ACCGTGGTGG TCAACGGGGT GTCCAAGGCC
TACGCCATGA CCGGCTGGCG GGTCGGTTAT GCCGCGGGCC CCGAGGCGCT GATCGGCGCC
ATGAAGAAGA TCCAGTCCCA ATCGACCTCC AACCCGGCCT CGGTCTCCCA GGCCGCGTCG
GTGGCGGCGC TGGACGGTGA TCAGGGCTGC ATCCCGCCCA TGCTGGAGCA ATTTAAGAAG
CGCCACGACT TCGTGGTGGA CGCCCTGAAC AAGATCGACG GCGTCGAGTG CCGCCCCTGC
GAGGGCACCT TCTACTGCTT CCCCAATATG CAGGGTGCCA TCGACAAGCT GGACGGCGTC
GGCAATGACG TGGAACTGGC CGGGTTCCTG CTGGAGCAGG GTGTGGCCCT GGTGCCCGGC
TCCGCTTTCG GTCTCGAGGG CTATGCCCGG ATCTCCTTCG CCACCAGCAT GGAGAACCTG
GAGAAGGCCA TGGAGCGGAT CGCCAAGGCG CTGGGCTGA
 
Protein sequence
MDIKLANRVQ RVKPSPTLAV TAKAAELRAA GKDIIGLGAG EPDFDTPEHI RDAAITAINE 
GETRYTPVDG TPALKKAVIN KFKRENGLDY DGKQVLVSSG AKHSLYNLMC ALLNEGDEVI
IPAPYWVSYP DMAKLADAEP VIIEAGQEQG FKITPEQLEG AITDRTRLFV INSPSNPTGS
AYSKAELAAL GEVLKKHPQI VVVTDDIYEH ILFEGEFVNI VNACPELKDR TVVVNGVSKA
YAMTGWRVGY AAGPEALIGA MKKIQSQSTS NPASVSQAAS VAALDGDQGC IPPMLEQFKK
RHDFVVDALN KIDGVECRPC EGTFYCFPNM QGAIDKLDGV GNDVELAGFL LEQGVALVPG
SAFGLEGYAR ISFATSMENL EKAMERIAKA LG