Gene Mlg_2214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2214 
Symbol 
ID4268686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2520199 
End bp2521296 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content70% 
IMG OID638126970 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_743046 
Protein GI114321363 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.904868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGTT CCTCACGCAG CGCCCTGATC GACGCTTGGG TGCGCCCGGA GGTGCGGGCG 
CTGAGCGCCT ACCACGTGCC GCCACCGGCG GACGTGATCA AGCTGGACGC CATGGAGAAC
CCCTGGCCCT GGCCCGGGGA ACTGGTCGAG GCCTGGCAGG CAGCGCTGTC GGAGGTGCCG
CTCAACCGCT ATCCCGACGC CGGGGCCGGC GAACTGAAGG CCCGGCTGCG CGCGGCCATG
GGCATCCCGG AGGGGGCCGC GCTCATGCTC GGCAACGGCT CGGATGAGTT GATCCTGCTC
ATCAACCTCC TGGTGGCGGG GCCCGGCCGG GTCGTGCTCA CCCCGGGGCC GGGTTTCGCC
ATGTACCGCA TCATCGCGGT CAACAGCGGC CTGGGCTACC GCGAGGTGCC GCTGGCCGAG
GACTTTGAGC TGGACGAGAC GGCCATGCGG GCGGCGCTGC GGGAGGTGCA GCCGGCGGTG
GTCTATATTG CCTATCCCAA CAACCCCACC GGCAATGCCT TTAATCGCGA CGCCCTGGCC
CGCGTGATCC ACGAGGCCCC AGGCCTGGTG GTGGTGGACG AGGCCTACTA CGCCTTCGCC
AACGACAGCT TCCTGCCCGA TGTGCTGGCC TATCCCAACC TGCTGGTGAT GCGCACCGTC
TCCAAGGTGG GGCTGGCCGG CCTGCGCCTG GGCCTGGTGG CCGGCCACCC GGACTGGCTT
GAGGAACTGG AGAAACTGCG CCTGCCCTAC AACATCAGTG CCCTTACCCA GGCCAGTGCC
AACTTTGCGC TGGATCACCG CGAGCAACTC GAGCGCAGCG TGGCCGCCAT CGTGGCCGAG
CGCGGGCGTT TGGCCGAAGG GCTGGCGGCC ATTCCCGGGG TGGAACGGGT CTGGCCCAGC
GAGGCCAACT TCCTCACCTT CCGGGGCCCG GCCGGGGCCG CCGGCCGGGT GCACAGCGGG
CTGCGGGAGC ACGGCGTGCT GATCAAGAAC CTGGACGGTA GTCACCCGCA GCTCGCCGAC
TGCCTGCGGG TGACGGTGGG CCGGCCGGAG GAAAACGCCC GCTTCCTGGC GGCGCTGGGC
CAGGTGCTGT CAGGGTAA
 
Protein sequence
MSRSSRSALI DAWVRPEVRA LSAYHVPPPA DVIKLDAMEN PWPWPGELVE AWQAALSEVP 
LNRYPDAGAG ELKARLRAAM GIPEGAALML GNGSDELILL INLLVAGPGR VVLTPGPGFA
MYRIIAVNSG LGYREVPLAE DFELDETAMR AALREVQPAV VYIAYPNNPT GNAFNRDALA
RVIHEAPGLV VVDEAYYAFA NDSFLPDVLA YPNLLVMRTV SKVGLAGLRL GLVAGHPDWL
EELEKLRLPY NISALTQASA NFALDHREQL ERSVAAIVAE RGRLAEGLAA IPGVERVWPS
EANFLTFRGP AGAAGRVHSG LREHGVLIKN LDGSHPQLAD CLRVTVGRPE ENARFLAALG
QVLSG