Gene Mfla_1491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1491 
Symbol 
ID4000952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1596501 
End bp1598264 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content62% 
IMG OID637938402 
Productarsenite-activated ATPase (arsA) 
Protein accessionYP_545600 
Protein GI91775844 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.738172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.516588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAT TCCTGCAACT TCCTCCCCGT TTTCTGTTTT TCACGGGCAA GGGTGGCGTC 
GGCAAGACCT CGATTGCCTG CGCCACGGCC ATTCAGTTAG CCGAAGCCGG AAAGCGCGTC
CTCCTGGTCA GTACCGACCC GGCATCCAAT GTCGGGCAGG TATTTGGTGT CGATATCGGT
AATCGCGTCA CACCGATTCC GGCGGTTCCA CGTCTTTCTG CTCTGGAGAT TGATCCCGAG
GCAGCGGCCA GTGCCTATCG GGAGCGCCTG GTTGGCCCGG TCCGCGGCGT GCTTCCTGAT
GACGTGGTGA AGGGCATCGA GGAATCGTTG TCCGGCGCGT GTACCACCGA AATCGCCGCA
TTTGACGAGT TCACTGCGCT ACTGACCAAC GCGGCACTCA CGGCTGATTA CGAGCACATC
ATCTTTGACA CTGCGCCCAC CGGCCACACC ATCCGCTTGC TGCAACTGCC GGGCGCGTGG
AGCGGTTTCC TGGAAGCTGG CAAGGGTGAT GCCTCGTGCC TAGGCCCGCT GGCCGGTCTG
GAAAAGCAGC GTACTCAGTA CAAGGCGGCT GTTGAAGCCT TGGCTGATCC GCTGCAAACC
CGCCTGGTGC TGGTCGCTCG CGCCCAGCAG GCGGCCTTGC GCGAGGTCGC CCGAACCCAC
GAAGAACTGG CAGCCATAGG CCTCAAACAG CAACATCTCG TCATCAACGG CATCCTGCCG
CACGTCGAAG CCGCTACCGA CCCGCTGGCC GCTGCAATCC ACGAACGGGA ACAAACAGCG
CTGAAGAACA TCCCGGCTAC GTTGACTGCG CTTCCGCGTG ATCACGTAGA ACTTAAGCCC
TTCAATCTCG TCGGCCTTGA AGCACTGCGG CAGTTGCTGA CCGACCTTCC TCCACAAGCA
CCCGCAGCGG TTGATTCCCC GATCGAACTC GACGAGCCCA GCGTGGCCGA GCTGATCGAC
GGCATCGCGG CGGATGGACA CGGGCTGATC ATGTTGATGG GCAAGGGTGG TGTAGGCAAG
ACAACCCTGG CGGCCGCCAT CGCGGTCGAA CTGGCGCATC GCGGCTTACC GGTGCATCTG
ACGACCTCCG ATCCTGCTGC CCACTTGACC GACACACTGG ATTCCTCGCT CGATAATCTG
ACCGTGAGCC GAATCGATCC GCATGCCGAG ACCGAGCGCT ATCGCCAGCA CGTCCTGGAA
ACCAAGGGCG CTCAACTCGA TGCCGAAGGT CGCGCGCTGT TGGAAGAGGA TTTGCGTTCG
CCCTGCACGG AAGAGATTGC TGTCTTCCAG GCGTTCTCCC GCATCATTCG CGAGGCCGGG
AAAAAGTTCG TCGTCATGGA CACGGCCCCG ACCGGGCACA CCTTGCTCCT GCTCGACGCG
ACGGGTGCGT ATCACCGCGA AGTGTCACGA CAAATGGGCA AGACCGGCGT GCACTTCACG
ACGCCGATGA TGCAATTGCA GGACCCGAAG CAAACGAAGG TACTCGTCGT CACGCTGGCG
GAGACGACGC CGGTACTGGA GGCCGCCAAC CTGCAAGCTG ATTTGCGCCG TGCCGGGATC
GAGCCCTGGG CCTGGATCAT CAACACCAGC GTGGCGGCAG CTTCGGCCAA GTCGCCGTTA
CTGCGTCAGC GTGCGGCCAA CGAGCTACGC GAAATCAGCG CTGTGGCGAA TCAGCACGCG
GACCGTTACG CGGTTGTCCC GCTGCTGAAG GAAGAACCGA TCGGTACAGA ACGACTGCGT
GCGCTCATCC ATCCTCAAGC ATAA
 
Protein sequence
MMKFLQLPPR FLFFTGKGGV GKTSIACATA IQLAEAGKRV LLVSTDPASN VGQVFGVDIG 
NRVTPIPAVP RLSALEIDPE AAASAYRERL VGPVRGVLPD DVVKGIEESL SGACTTEIAA
FDEFTALLTN AALTADYEHI IFDTAPTGHT IRLLQLPGAW SGFLEAGKGD ASCLGPLAGL
EKQRTQYKAA VEALADPLQT RLVLVARAQQ AALREVARTH EELAAIGLKQ QHLVINGILP
HVEAATDPLA AAIHEREQTA LKNIPATLTA LPRDHVELKP FNLVGLEALR QLLTDLPPQA
PAAVDSPIEL DEPSVAELID GIAADGHGLI MLMGKGGVGK TTLAAAIAVE LAHRGLPVHL
TTSDPAAHLT DTLDSSLDNL TVSRIDPHAE TERYRQHVLE TKGAQLDAEG RALLEEDLRS
PCTEEIAVFQ AFSRIIREAG KKFVVMDTAP TGHTLLLLDA TGAYHREVSR QMGKTGVHFT
TPMMQLQDPK QTKVLVVTLA ETTPVLEAAN LQADLRRAGI EPWAWIINTS VAAASAKSPL
LRQRAANELR EISAVANQHA DRYAVVPLLK EEPIGTERLR ALIHPQA