Gene Mlg_2524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2524 
Symbol 
ID4270163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2866702 
End bp2867913 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content72% 
IMG OID638127283 
Producthypothetical protein 
Protein accessionYP_743354 
Protein GI114321671 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.585114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGGC AACGCCACCC GGCCCTCGAC CTGCCCGAGC CCGATGCCAG CGCCCGCGCC 
CACAGCGAGG CCCTGCAGGC GCGGATCCGC GACGCCATCC GATCCGCCGG CGGCTGGCTG
CCGTTCGACC GCTACATGGG CATGGCCCTG TACGAGCCCG GACTGGGTTA CTACAGCGCC
GGCGCGCCGC GCTTCGGCGA AGGTGGCGAC TTCACCACCG CACCGCTCAT CTCGCCACTT
TTCAGCCGCA CCCTGGCCCA CACCGTACAG CGCGCCCTGC AGGCCCTGGA GCTCGCCACC
GGCCAGGGCG AGGTGCTGGA ACTGGGCGCC GGCAGCGGAC GGATGGCCGC CGACATCCTG
CTGGAGCTGG AGCGGCTGGG GCAGCTTCCC GCCCGCTACC TCATCCTCGA GGTCAGCGCC
GCCCTGCGCC AGGAACAGCA CCGCACCCTG GGTGAACACG CCCCCCACCT GCTCGACCGG
GTGGAGTGGC TGGAACAGCT CCCGGAACAC CCCATTACCG GCGCCCTCCT CGCCAACGAA
GTCCTCGACG CCCTGCCCTT TCGCTGCTTC GAGCGCGGGC GCGACGACAT CCTGGAACGC
GGCGTGGCGC TGGACGACGA CGACCACCCG CAGTGGGCCA CCCGTCCCGC CGATGAGCCC
CTGGCCGGCC ACGTCCGCCA CATCGAGGCC GAGACCGGCC GGCGGCTGCC CCCCGGTTAC
CGCAGCGAGT GCCTGCCGCA ACTGGCCGAT TGGCTGCGCG ACACCACCCG CTGCCTGGCG
CGGGGCCTGG TACTCTACAT CGATTACGGC TACCCCCGGC GCGAGTACTA CCTGCCCGAC
CGCCACATGG GCACCCTGCT CTGCCACTAC CGCCACCGCG CCCACGAGGA CCCTTTCCTC
TGGCCCGGGC TGCAGGACAT CACCGCCTTC GTCGACTTCA CGGCCGTGGC CGAGGCGGCA
CTGGCCGCCG ACCTGGACGT GCTCGGCTTC ACCAGCCAGG CCCAATACCT GCTCGCCGCC
GGCCTGGCGC ACCTGGCCGA CGAGGCCATG GCGCAGCACG ACGACGACAT GCACCGCCTT
CAGATCGCGC AACAGGTCCG CCGCCTCACC CTGCCCTCCG AACTGGGCGA GCGCTTCAAG
GTCCTGCCCC TGGGCCGCGA CCTGGCCCCC CTGCCGGAAT TCATCCGCAC CGACCAGCGC
CACCGCCTTT GA
 
Protein sequence
MTRQRHPALD LPEPDASARA HSEALQARIR DAIRSAGGWL PFDRYMGMAL YEPGLGYYSA 
GAPRFGEGGD FTTAPLISPL FSRTLAHTVQ RALQALELAT GQGEVLELGA GSGRMAADIL
LELERLGQLP ARYLILEVSA ALRQEQHRTL GEHAPHLLDR VEWLEQLPEH PITGALLANE
VLDALPFRCF ERGRDDILER GVALDDDDHP QWATRPADEP LAGHVRHIEA ETGRRLPPGY
RSECLPQLAD WLRDTTRCLA RGLVLYIDYG YPRREYYLPD RHMGTLLCHY RHRAHEDPFL
WPGLQDITAF VDFTAVAEAA LAADLDVLGF TSQAQYLLAA GLAHLADEAM AQHDDDMHRL
QIAQQVRRLT LPSELGERFK VLPLGRDLAP LPEFIRTDQR HRL