Gene Mlg_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1417 
Symbol 
ID4270415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1622901 
End bp1623917 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content70% 
IMG OID638126173 
Productaminodeoxychorismate lyase 
Protein accessionYP_742256 
Protein GI114320573 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.626218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.75601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGG CCCGTCTCAC GCTGCTGACG ACGCTGCTCT TGGTGATGGC CGCCCTGGCG 
GTCGGTGCCT GGGCCTGGCA GGCCTGGGAC CGGCTCACTG CGCCTATCAC AGCGGACGGG
GAGTCGGTGG TCATCGAGAT CCCCCGGGGC GCCTCCTTCC GCCAAGTGGT TGAGCGGCTG
GAGCGGGAGA CGGCCTTCGA GGATGGCCTG GCCCTGCGGC TGTACGCCCG CTATACCGGC
GACGACGCCC GGGTCCAGGC GGGCGAGTAC GCCCTGGAGC CGGGCATCAG CGTGCTGGAT
GCCCTGGAGC GGTTCGCCCG AGGCGAGGTC GTCCAGCACC GCATCACCGT GGTCGAGGGC
CTCACCTTCC GCCAGATGCG GCGTCTCATC GAAGCCCACC CGGCCCTGGA GCAGACCCTT
AAGGGGCTGG ACGATGAGGG GGTGATGGCC GAGCTGGGCA AGCCGGATCG TCACCCGGAG
GGCTGGTTCT ACCCCAGTAC CTACACCTTC CCCCGCGGGA CCACCGACCG TGACCTGCTG
GCCCGCGCCA TGCGCCGCAT GGAGCGCCGC CTGGAGGAGG AGTGGGCGGC GCGGGCCGAC
GGACTGCCCC TGGAGACGCC CTACGAGGCG CTGATTCTGG CCTCCATCAT CGAGCGCGAG
ACCGGGCGGG ACGGGGAGCG GGCGAAGGTG GCCGGCGTCT TCACCCGGCG GCTGGAAAAG
GGCATGCGCC TGCAGACCGA CCCGACGGTC ATCTACGGCA TGGGTGAGGC CTATGACGGG
CGCATACGCA GCGCCGATCT GCGCCGGGAC ACGCCTTATA ACACCTACAC CCGCCACGGC
CTGCCCCCGA CGCCCATTGC CATGCCCGGC AGCGCCTCGA TCCGCGCGGC CGTGAACCCG
GCGGACCACG ACTACCTCTA CTTCGTCTCG CGCGGCGACG GCAGCCACCA ATTCTCCCGC
ACCCTGGCGG AACACAACCG TGCCGTGCGC CGCTACATTC TGGGGGAGGG CGAATGA
 
Protein sequence
MNWARLTLLT TLLLVMAALA VGAWAWQAWD RLTAPITADG ESVVIEIPRG ASFRQVVERL 
ERETAFEDGL ALRLYARYTG DDARVQAGEY ALEPGISVLD ALERFARGEV VQHRITVVEG
LTFRQMRRLI EAHPALEQTL KGLDDEGVMA ELGKPDRHPE GWFYPSTYTF PRGTTDRDLL
ARAMRRMERR LEEEWAARAD GLPLETPYEA LILASIIERE TGRDGERAKV AGVFTRRLEK
GMRLQTDPTV IYGMGEAYDG RIRSADLRRD TPYNTYTRHG LPPTPIAMPG SASIRAAVNP
ADHDYLYFVS RGDGSHQFSR TLAEHNRAVR RYILGEGE