Gene Mlg_2250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2250 
Symbol 
ID4269191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2549950 
End bp2551458 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content69% 
IMG OID638127007 
Productanthranilate synthase component I 
Protein accessionYP_743082 
Protein GI114321399 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.327103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCGG AACAACTGCG TGAACTGGCC GAGGCCGGCT ACAACCGCGT CCCGCTGGTG 
CGGGAGATCC TCGCCGATCT GGACACCCCG CTGTCCACCT ACCTGAAGCT GGCCCAGGGG
CCCTATTCCT ATTTGTTCGA GTCGGTCCAG GGCGGTGAGA AGTGGGGCCG TTATTCCATC
ATCGGCCTGC CTTGCCGCAC GGTGCTGCGG GTATGGGGCC AGGAGGTGGA GGTCAGCGAG
GATGACCGGG TGGTGGAACG CATCCCGGTG GACGATCCGC TGGCCTGGAT CGAGGCCTAC
CAGGCCCGCT TCAAGGCGGC CGATCCGCCC GGCTTTCCGC AGATGCCCCG CTTTCTCGGC
GGGCTGGTGG GCTACTTCGG CTACGAGACC GTGGGGTATA TCGAGCCGCG CCTGGCCGGC
CGGGCCAACC GCGACGAGCT GGAGGTGCCC GACATCCTGT TGATGGTCAG CGAGGAGGTG
CTGGTCTTCG ACAACCTGGC CGGCCGCCTC TACCTGGTGG TGCAGGTGGA CCCGGCCGCC
GAGGGGGCGC TGACACGCGG GCAGGCGCGC CTGGAGGAGC TGGCCGAGCG GCTGCGCCGC
GCCGACTCGG TCTACCAGCG GCGCCGTCCT CCACGGCGGG TGCTGGAGTC GGACTTTCGC
TCGAACTTCA CCCAGCCGGA GTTTGAGGCG GTGGTGGAGC GTATCCGCGA GTACATCCTG
GCCGGCGACT GCATGCAGGT GGTGCCCTCC CAGCGCATGT CGGTGCCCTA CCGGGCGGAG
CCGCTGGACC TCTACCGGGC GCTGCGCTGC ACCAATCCCT CCCCCTATAT GTATTACCTG
GACCTGGGCG ATTTCCACGT GGCCGGCTCG TCGCCGGAGA TCCTGGTGCG GCTGGAGGAC
GACCAGGTGA CGGTGCGGCC CATCGCCGGT ACCCGACGCC GTGGCCACGA CGAGGCCGAG
GACCTGGCCC TGGAGGCCGA TCTGGTCAGT GACCCCAAGG AGCTGGCTGA GCACCTGATG
CTCATCGACC TGGGGCGCAA CGACGTGGGC CGGGTGGCGG ACACCGGCAG CGTGCGGGTC
ACCGAGCGGA TGGTGGTGGA GCGCTACTCC CACGTCATGC ATATCGTCTC CAACGTCACC
GGGCGGCTGC GCCCCGGGCA GGGGCCCATG GAGGTGCTGC GCGCTACCTT CCCGGCGGGT
ACGGTGAGCG GGGCGCCGAA GATACGGGCC ATGGAGATCA TCGCCGAGGT GGAGCCGGTC
AAGCGGGGGG TGTACTCGGG GGCGGTGGGG TACCTGTCCT GGTCCGGCAA CCTGGATACC
GCCATCGCCA TCCGCACCGC GGTGATCAAG GACGGCCGGG TGTACGTGCA GGCGGGCGCT
GGGGTGGTGG CCGACTCGGT GCCGCGGCTG GAGTGGAAGG AGACGCTGAA CAAGGGCCGC
GCCCTGTTCC GCGCCGTGGA GATGGCCGAG CACGGCCTGG ATAACCCACC GGCGGTGGAG
GCGCACTGA
 
Protein sequence
MQPEQLRELA EAGYNRVPLV REILADLDTP LSTYLKLAQG PYSYLFESVQ GGEKWGRYSI 
IGLPCRTVLR VWGQEVEVSE DDRVVERIPV DDPLAWIEAY QARFKAADPP GFPQMPRFLG
GLVGYFGYET VGYIEPRLAG RANRDELEVP DILLMVSEEV LVFDNLAGRL YLVVQVDPAA
EGALTRGQAR LEELAERLRR ADSVYQRRRP PRRVLESDFR SNFTQPEFEA VVERIREYIL
AGDCMQVVPS QRMSVPYRAE PLDLYRALRC TNPSPYMYYL DLGDFHVAGS SPEILVRLED
DQVTVRPIAG TRRRGHDEAE DLALEADLVS DPKELAEHLM LIDLGRNDVG RVADTGSVRV
TERMVVERYS HVMHIVSNVT GRLRPGQGPM EVLRATFPAG TVSGAPKIRA MEIIAEVEPV
KRGVYSGAVG YLSWSGNLDT AIAIRTAVIK DGRVYVQAGA GVVADSVPRL EWKETLNKGR
ALFRAVEMAE HGLDNPPAVE AH