Gene Mlg_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2554 
Symbol 
ID4270942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2897032 
End bp2898348 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content69% 
IMG OID638127313 
Productaminotransferase 
Protein accessionYP_743384 
Protein GI114321701 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0255336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0169818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCAAT ACGCTAACCA CCCCGGGCGT CGCAACCACG CCCCGGAGAT CCACCTTAAT 
CTCAACGTCC GCGGGCTGGG CCAGTCCGCC ACCCTCGTCA TCAACGAGCG CAGCGCGGCC
CTGGCCGCGC AGGGGCGCCA CGTCTACCGC TTCGGTCTCG GCCAGTCCCC CTTCCCGGTC
CCGGGGCCGG TGGAGGCCGA GCTCAAGGCC AATGCCCACC AGAAGGACTA CCTGCCGGTG
GAGGGCCTGC GCAACCTGCG CGAGGCGGTG GCCGAGTACC ACCGGCGCAG CCAGGGCGTG
GAGCTGTCGG CCGAGGATGT GCTGATCGGT CCGGGCTCCA AGGAGCTGAT GTTCATCCTG
CAGTTGGTCT ACTACGGCGA CCTGGTCATT CCCACCCCCA GTTGGGTCTC CTACGCCCCC
CAGGCCCACA TCATCGGCCG GCAGATCCGC TGGGTGCAGA CCCGCTACGA GAACGACTGG
CGCCTGCTGC CCGAGGAGTT GGAGAAGCTC TGTGCGGAGG ATCCCTCCCG GCCCCGCATC
CTGATCCTCA ACTACCCGAA CAACCCCACC GGCGAGAGCT ATACCGCGGA CGAGCTGCGG
GGGCTGGCCC GGGTCGCCCG CAAGTACCGG GTGGTCCTGC TCTCGGACGA GATCTACAGC
GAGCTGCACC ACCGGGGCCA GCATGTCTCG GTGGCGCGCT TCTACCCGGA GGGCACCATC
ATCAGCAGCG GGTTGAGCAA GTGGTGCGGG GCGGGGGGGT GGCGGCTGGG TACCTTCGCC
TTCCCCCGCG GGCTGCACTG GTTGCTGGAG GCCATGGCGG TGGTGGCCAG CGAGACCTAC
ACCTCCACCA GCTCTCCGAT CCAGTATGCG GCGGTGCGCG CCTTCCAGGG CGGCCTGGAG
ATCGAACAAT ACCTGCAGCA GTCCCGGCGC GTGTTGCAGG CACTGGGCCG GTACTGCTGG
CGGCGCCTCG ATCAGGCGGG CCTGTCCACG CCCCGTCCGG TTGGCGGGTT CTACCTGTTC
CCCGACTTCA GCCCACAGCG CGAGCGGCTG GTGGCGCGCG GCATCCACAC CGCGCCGGCG
CTCTGCAACC GGCTGCTGCA GGAGACCGGC GTGGCGCTGC TACCGGGCAG TGCGTTCGGG
CGGCCCGAGG CGGAACTGTC GGCCCGGCTC GCCTATGTGG ACTTCGACGG CGCCCGGGTG
CTCACTGCGG CGTCGGCGGA ACCGGCGGGC AAGCTCTCCG AGGAGTTCCT CGAACATTGC
TGTCCCAACG TCGTGGCCGG GATGGAGCGC ATCGTGGATT GGGTGCAGCG CGCCTAG
 
Protein sequence
MPQYANHPGR RNHAPEIHLN LNVRGLGQSA TLVINERSAA LAAQGRHVYR FGLGQSPFPV 
PGPVEAELKA NAHQKDYLPV EGLRNLREAV AEYHRRSQGV ELSAEDVLIG PGSKELMFIL
QLVYYGDLVI PTPSWVSYAP QAHIIGRQIR WVQTRYENDW RLLPEELEKL CAEDPSRPRI
LILNYPNNPT GESYTADELR GLARVARKYR VVLLSDEIYS ELHHRGQHVS VARFYPEGTI
ISSGLSKWCG AGGWRLGTFA FPRGLHWLLE AMAVVASETY TSTSSPIQYA AVRAFQGGLE
IEQYLQQSRR VLQALGRYCW RRLDQAGLST PRPVGGFYLF PDFSPQRERL VARGIHTAPA
LCNRLLQETG VALLPGSAFG RPEAELSARL AYVDFDGARV LTAASAEPAG KLSEEFLEHC
CPNVVAGMER IVDWVQRA