Gene Mlg_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0312 
Symbol 
ID4270772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp353409 
End bp354365 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content68% 
IMG OID638125038 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_741157 
Protein GI114319474 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACG CACTGCTGAA CCGGCAGGTG GTGTTCTTCG CCGGCAAGGG CGGGGTCGGC 
AAGTCCACCT CCGCTGCGGC CTTCGCCCTC TATGCGGCGG ACCAGGACCG GCGGGTGCTG
CTGGTCTCCA CCGACCCGGC ACACAACCTG GCGGACTTGT TTCACACCCC CATCGGCGGC
GAGGGCATCA CCCGCGTCGC CCCCAACCTG GACGCCGTCG AGGTGGACGT CCATCGGGAG
ACCCATCGCT ACCTGGACGG GGTCAAGGAG AATATCCGGC GCACGGTGCG CTCCACCATG
CTGGACGAGG CCCTGCGCCA GATCGACCTG GCCGCGCACT CCCCCGGAGC CGCCGAGGCC
GCCCTGTTCG ACCGTATGGT CAGCCTGATT CTGGAGGAGT CCCAGGCCTA TGACCTGCTG
GTGTTCGACA CCGCCCCCAC CGGGCATACG GTACGGCTGC TCACCCTGCC CGAACTCATG
GGCACCTGGG TGGACGGGCT GCTCAAGCGG CGCCACAAAC GCAACAGGGA CTACTCCCAC
TGGCTGGGCG ACGGCGAGGT CCCGGACGAC CCGCTCTACG ACGTGCTCAG CCGACGCCGC
CAGCGGGCCG CGGCCATGCG TGACATCCTG TTGGATGACC AGACCACCGC CTTTGTCTTT
GTCCTCGTCC CGGAATACCT GCCCATCACC GAGACCCGCA ACGCCATCCG GGAGCTGGCC
GACTGGAACA TCCACGTGCG CCACCTGGTG GTGAACAAGC TCCTGCCTGA AGGCGTCACC
GATCCCTTCT TCCGAGAGCG GCTGGCCCGG GAGCACCGCT GGCTGGCACG TATCGATGAG
TACTTCCCGG ATCTGCACCC GTTGCGCCTG CCGCTGTTGC CAGGCGACGT GGACAGCCGC
GAAGCGCTCG ACCAGGTGGC GCGGGAGATG GCCCGGGCCC TGGACCCGGG GCGCTGA
 
Protein sequence
MDNALLNRQV VFFAGKGGVG KSTSAAAFAL YAADQDRRVL LVSTDPAHNL ADLFHTPIGG 
EGITRVAPNL DAVEVDVHRE THRYLDGVKE NIRRTVRSTM LDEALRQIDL AAHSPGAAEA
ALFDRMVSLI LEESQAYDLL VFDTAPTGHT VRLLTLPELM GTWVDGLLKR RHKRNRDYSH
WLGDGEVPDD PLYDVLSRRR QRAAAMRDIL LDDQTTAFVF VLVPEYLPIT ETRNAIRELA
DWNIHVRHLV VNKLLPEGVT DPFFRERLAR EHRWLARIDE YFPDLHPLRL PLLPGDVDSR
EALDQVAREM ARALDPGR