Gene Mlg_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2175 
Symbol 
ID4270954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2472727 
End bp2473713 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content70% 
IMG OID638126931 
Productprotein of unknown function DUF900, hydrolase family protein 
Protein accessionYP_743007 
Protein GI114321324 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.334078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.109087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACG ACTTTGTGGT CTGCGTCATC AACACCCGTG TCCGCAGCGG CAAGCGGGTC 
TTCGGCCGCG CGCCGGGGCC CACCCGGTTC CTGCTCGTCC CCGATGGCGA GGTGCAGCAA
CCCGCGCACA CCGTGCCCCG CGCCGAGTGG GTGGAGGCGG TCATGGCCGC CGGCACCACC
GGCAGGGACC CCATGTCCGA CAACCCCACC GGCAACGTCC TGGTCTTCAT CCACGGCTAC
AACAACAGCC AGGAGATCGT CATCAAGCGC CACCGCAAGC TCAAGGCGAC GCTGCACGCG
GCCGGCTACC GGGGCACCGT GGTCAGCTTC GACTGGCCCA GCGCCGAGGC CACGCTGCTC
TACATGCGCG ACCGCCGCTA CGCCAAGCAC ACCGCGGAGC GGCTCACCGA CGACTGCATC
AGCCTGTTCT CGACACGCCA GGCGCGCGGC TGTGACCTGA ATGTCCACCT GCTGGGCCAC
TCCACCGGCG CCTACGTCAT CCGCCACGCC TTCGCCGACG CCGACGAGGT CGCCGAGATC
AAGAACCGCC CCTGGAAGGT CAGCCAGATC GCTCTCATCG GTGCCGATGT CTCCAGCAGT
TCCCTGGCCG CCGATGACTC GCGCTTCGTC TCCGTCTACC GCCACTGCTC CCGGCTCACC
AACTACCAGA GCGGCCACGA TGGCGTGCTG CGCGTCTCCA ACGCCAAGCG CATCGGCCTG
CGCGCCCGCG CCGGCCGGGT CGGCCTGCCC GACAACGCCC ACCGCAAGGC GGTGAACGTG
GACTGCAGCC CCTACTTCGC CGGCATCGAC CCGGACAGCC GCACCCCCGG CGAGGACTAC
TTCGGCAACT TCGCCCACTC CTGGCACATC GGCGACCCGC TGTTCGCCCG CGACCTCTGC
CACACCCTGC ACGGCGAACT GGACCGCCAC TCCATCCCCA CCCGGCGCGA GGAGGACGAC
CGGCTGTACC TGCACGACCC CGGCTGA
 
Protein sequence
MSHDFVVCVI NTRVRSGKRV FGRAPGPTRF LLVPDGEVQQ PAHTVPRAEW VEAVMAAGTT 
GRDPMSDNPT GNVLVFIHGY NNSQEIVIKR HRKLKATLHA AGYRGTVVSF DWPSAEATLL
YMRDRRYAKH TAERLTDDCI SLFSTRQARG CDLNVHLLGH STGAYVIRHA FADADEVAEI
KNRPWKVSQI ALIGADVSSS SLAADDSRFV SVYRHCSRLT NYQSGHDGVL RVSNAKRIGL
RARAGRVGLP DNAHRKAVNV DCSPYFAGID PDSRTPGEDY FGNFAHSWHI GDPLFARDLC
HTLHGELDRH SIPTRREEDD RLYLHDPG