Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0312 |
Symbol | |
ID | 4270772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 353409 |
End bp | 354365 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125038 |
Product | arsenite-activated ATPase ArsA |
Protein accession | YP_741157 |
Protein GI | 114319474 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAACG CACTGCTGAA CCGGCAGGTG GTGTTCTTCG CCGGCAAGGG CGGGGTCGGC AAGTCCACCT CCGCTGCGGC CTTCGCCCTC TATGCGGCGG ACCAGGACCG GCGGGTGCTG CTGGTCTCCA CCGACCCGGC ACACAACCTG GCGGACTTGT TTCACACCCC CATCGGCGGC GAGGGCATCA CCCGCGTCGC CCCCAACCTG GACGCCGTCG AGGTGGACGT CCATCGGGAG ACCCATCGCT ACCTGGACGG GGTCAAGGAG AATATCCGGC GCACGGTGCG CTCCACCATG CTGGACGAGG CCCTGCGCCA GATCGACCTG GCCGCGCACT CCCCCGGAGC CGCCGAGGCC GCCCTGTTCG ACCGTATGGT CAGCCTGATT CTGGAGGAGT CCCAGGCCTA TGACCTGCTG GTGTTCGACA CCGCCCCCAC CGGGCATACG GTACGGCTGC TCACCCTGCC CGAACTCATG GGCACCTGGG TGGACGGGCT GCTCAAGCGG CGCCACAAAC GCAACAGGGA CTACTCCCAC TGGCTGGGCG ACGGCGAGGT CCCGGACGAC CCGCTCTACG ACGTGCTCAG CCGACGCCGC CAGCGGGCCG CGGCCATGCG TGACATCCTG TTGGATGACC AGACCACCGC CTTTGTCTTT GTCCTCGTCC CGGAATACCT GCCCATCACC GAGACCCGCA ACGCCATCCG GGAGCTGGCC GACTGGAACA TCCACGTGCG CCACCTGGTG GTGAACAAGC TCCTGCCTGA AGGCGTCACC GATCCCTTCT TCCGAGAGCG GCTGGCCCGG GAGCACCGCT GGCTGGCACG TATCGATGAG TACTTCCCGG ATCTGCACCC GTTGCGCCTG CCGCTGTTGC CAGGCGACGT GGACAGCCGC GAAGCGCTCG ACCAGGTGGC GCGGGAGATG GCCCGGGCCC TGGACCCGGG GCGCTGA
|
Protein sequence | MDNALLNRQV VFFAGKGGVG KSTSAAAFAL YAADQDRRVL LVSTDPAHNL ADLFHTPIGG EGITRVAPNL DAVEVDVHRE THRYLDGVKE NIRRTVRSTM LDEALRQIDL AAHSPGAAEA ALFDRMVSLI LEESQAYDLL VFDTAPTGHT VRLLTLPELM GTWVDGLLKR RHKRNRDYSH WLGDGEVPDD PLYDVLSRRR QRAAAMRDIL LDDQTTAFVF VLVPEYLPIT ETRNAIRELA DWNIHVRHLV VNKLLPEGVT DPFFRERLAR EHRWLARIDE YFPDLHPLRL PLLPGDVDSR EALDQVAREM ARALDPGR
|
| |