Gene Mlg_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1823 
Symbol 
ID4268178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2082912 
End bp2084135 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID638126579 
Productaminotransferase 
Protein accessionYP_742657 
Protein GI114320974 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGA GCGATGAGTT TCCCCGCATC AAGCGTCTGC CGCCCTATGT CTTCAATATT 
GTCAACGAGT TGAAGGCGGC GGCCCGTGCC CGCGGGGAGG ACATCGTGGA CTTCGGCATG
GGCAACCCGG ATCAGCCCAC CCCGCAGCAC ATTGTCGACA AATTGACGGA GGTGGCCCAG
CGCGGGGACA CCCACCGTTA CTCCATGTCC CGTGGCATCC CGCGCCTGCG TCGTGCCATT
TGTAACTGGT ACCGCGACCG CTACGATGTG GACCTGGACC GGGAGACCGA GGCCATCGTC
ACGATCGGCT CCAAGGAGGG TCTGGCGCAC CTGGCGCTGG CCACGCTGGC CCCGGGCGAC
GCGGTGCTGG TCCCCAACCC GGCCTACCCC ATTCACCCCT ACGGGGTGGT GATTGCCGGG
GCCGATATCC GGCATGTGCC CATGCTCCCC GACGGCGATT TCTTTGCCGA GATGGAGAAG
GCCATCCGGG ACAGTTATCC CAAGCCCAAG ATGTTGATCC TTAACTTCCC GTCGAACCCC
ACCAGTGCCT GCGTGGACCT GGAGTTCTTC GAGAAGGTGG TGGCGGTGGC GCGTCAGCAC
AACATCTGGG TGGTCCACGA TCTGGCCTAC GCCGATATCG TGTTCGATGG CTACCGGGCG
CCCTCCATCC TTGAAGTGCC GGGGGCGAAG GAGGTGGCCG TCGAGTCCTT CTCGCTGTCG
AAGAGCTACA ACATGCCGGG TTGGCGCGTC GGTTTCATGT GCGGCAACCG CCATCTGATC
GCGGCGCTGG CGCGCATGAA ATCCTACCTG GACTATGGCA CCTTCACGCC CATCCAGGTG
GCAGCCATTG CGGCGCTGGA GGGCCCTCAG GAGTGCGTCC AGGAGATCTG TGAGATGTAC
CGGCGGCGGC GTGACGTGCT CTGTGAAGGC CTTAATGCGG CCGGCTGGGA GGTGGAGAAA
CCCAAGGCCA CCATGTTCGT CTGGGCCCGC ATCCCGGAGC GCTATCGCGA CATGGGCTCG
CTGGAATTCG CCAAAAAGCT GCTGCGGGAT GCCAAAGTGG CGGTCTCGCC GGGGATCGGC
TTCGGTGATT ACGGCGACGA GTACGTGCGC TTCGGGCTGA TTGAGAACGA GCACCGCACG
CGTCAGGCCA TCCGCTGCAT CAAGCAAATG TTCCGTCGGG ACGGTCAGCA CGACCAACAA
CAGGAAGGGG AGGTGAGCTC TTGA
 
Protein sequence
MNLSDEFPRI KRLPPYVFNI VNELKAAARA RGEDIVDFGM GNPDQPTPQH IVDKLTEVAQ 
RGDTHRYSMS RGIPRLRRAI CNWYRDRYDV DLDRETEAIV TIGSKEGLAH LALATLAPGD
AVLVPNPAYP IHPYGVVIAG ADIRHVPMLP DGDFFAEMEK AIRDSYPKPK MLILNFPSNP
TSACVDLEFF EKVVAVARQH NIWVVHDLAY ADIVFDGYRA PSILEVPGAK EVAVESFSLS
KSYNMPGWRV GFMCGNRHLI AALARMKSYL DYGTFTPIQV AAIAALEGPQ ECVQEICEMY
RRRRDVLCEG LNAAGWEVEK PKATMFVWAR IPERYRDMGS LEFAKKLLRD AKVAVSPGIG
FGDYGDEYVR FGLIENEHRT RQAIRCIKQM FRRDGQHDQQ QEGEVSS