Gene Xaut_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_3950 
Symbol 
ID5421141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp4371052 
End bp4373514 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content65% 
IMG OID640883206 
Productarsenite oxidase large subunit 
Protein accessionYP_001418831 
Protein GI154247873 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR02693] arsenite oxidase, large subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.616487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTACA AGCGCCAGAT CGGCCAATTG CCCATCATTC CCGCCAATGC GACCGTCCAC 
AACGTCGTCT GCCACTATTG CATCGTCGGC TGCGGCTACA AGGCCTATAG CTGGGACGCC
CGATACGAGG GCGGCACGGC GCCGGGCGAG AACGCCTTCG GCGTCGATCT CTCCAGGCAA
CAGCCGGCAG AGACCGCAGC CTGGTATGCG CCGTCCATGT ACAACATCGT CAGGCAGAAC
GGGCGAGACG TGCACATCGT CATCAAGCCC GACAAGGCGT GCGTGGTGAA TTCCGGCCTC
GGCTCCGTGC GCGGCGCGCG GATCGCCGAG ATGAGCTATT CGCGCCAGCG CAACACCCAG
CTCCAGCGCC TCACCGATCC CCAGGTCTGG CGCTACGGAC AGCTCCAGCC CACGAGCTGG
GACGACGCCC TCGACCTCGT CGCCCGCGTC ACCGCAGCGG TGATCGCTGA GCAGGGCGAG
GACGGCCTGT TCGTGTCCGC CTTCGACCAC GGCGGCGCGG GCGGCGGATA CGAGAACACC
TGGGGCACCG GAAAGCTCTA TTTCGGCGCC ATGAAGGTAA AGAACATCCG CATCCACAAT
CGCCCGGCCT ACAATTCGGA GGTCCACGGC TCGCGCGACA TGGGGGTGGG CGAGCTCAAC
AATTGCTACG AGGACGCGGA GCTCGCCGAC ACCCTCGTAG CGGTGGGCAC GAACGCGCTG
GAAACCCAGA CCAACTATTT CCTGAACCAC TGGGTGCCCA ACCTGCGCGG AACCTCCCTC
GACAAGAAGA AGGCGGAGTT CGGCAGCGAG CCGGTGGCCA AGGCGCGCAT CGTCATCGTC
GATCCGCGCC GCACCGTCAC CGTGAATGCC AGCGAGGTGG AGGCGGGCAA GGAGAACGTG
CTCCATCTCG CCCTCAATTC CGGCACCGAC CTGATCCTGT TCAACGCCTG GCTCACCTAC
GCGGCGGAGA AGGGGTGGAT CGACAAGGGT TTCATCGCGG CCTCCACGAA GGACTTCGAC
AAGGCCCTGG CGGCCAACAA GGTGAGCGTG GCGGAAGCCG CTCGCGCCAC CGGCCTCAGC
GAGGCGGACA TCGTCAAGGC AGTCACCTGG ATCGGCGAGC CCAAGGCCGG CGGGGCACGT
CGGCGGACCA TGTTCGCCTA CGAGAAGGGC CTCATCTGGG GCAATGACAA CTACCGCACC
AACCAGGCGC TGGTGAATCT CGCCCTCGCC ACCGGCAATA TCGGGCGTCC GGGCGGCGGC
TGCGTGCGCA TGGGCGGCCA CCAGGAGGGC TATTGCCGCC CGTCCGACGC CCATGTGGGC
CGGCCCGCCG CCTATGTGGA CAAGCTGCTG ATCGAGGGAA AGGGCGGCGT GCACCACATC
TGGGGCTGCG ACCACTACAA GACCACGCTC AACGCCATGG CCTTCAAGCG CGCCTACAAG
ATGCGCACGG ACCTCGTCAA GGACGCCATG GCCAGCGTGC CCTACGGCGA CCGCGATGCC
ATGGTGGCCG CCATCTTGGG CGCCATCCGT AAGGGCGGGC TGTTCAGCGT CGACGTGGAT
ATCGTGCCGA CCCATATCGG CGAGGCCGCC CATGTGATGC TGCCCGCGGC CACCTCCGGC
GAGATGAACC TCACCTCCAT GAACGGCGAG CGCCGCATGC GCCTCACCGA GCGCTACATG
GACCCACCCG GGCAGGCCAT GCCCGACTGC CTCATCGCCG CGCGGATCGC CAACCACATG
GAGCGCGTTC TGCGCGCCAC GGGGAAGGCG GAGGCGGCGG ACAAGTTCAA GGGCTTCGAC
TGGAAGAGCG AGGAAGACGC CTTCATGGAC GGCTACGCCA AGAACGAAAA GGGCGGCGCG
TTCGTCACCT ACGATCGCCT GCGGACCATG GGCACTAACG GCTTCCAGGA GCCCGCCACG
GCCTTCGCAG ACGGCAAGAT CGTCGGCACC AAGCGGCTGT TCGCCGATGG CAAGTTCAAC
AAGCCGGACG GCAAGGCGGT GTTCGCCGAA ACCAGATGGC GCGGGCTCCA GGCCCCCGGC
AAGCAGGCGG AGAAGGACAA GTTCGCCTTC CTCATCAATA ACGGGCGGGC GAACCTCGTC
TGGCAGAGCG CCTATCTTGA CGTGGAGAAT GAGCTGGTCA TGGATCGCTG GCCCTATCCC
TTCATCGAGA TGAACCCGCA GGACATGGCC GAGCTTGGCC TCAAGAGCGG AGATCTTGTG
GAGGTCTACA ACGAGAACGG CTCCACCCAG GCCATGGCCT ATCCCACCCC CACGGCGAAG
CGGAAGGAGA CTTTCATGCT GTTCGGCTTC CCGACCGGCG TGCAGGGCAA TGTGGTGTCC
GCGGGGGTCA ACGAGGACAT CATCCCCAAC TACAAGCAGA CGTGGGGCAA CATCCGAAAG
ATCGCCGACG CACCCGAAGG CGTGCGGCAC CTGACCTTCA AGTCGAAGGA ATATCCCGCC
TGA
 
Protein sequence
MAYKRQIGQL PIIPANATVH NVVCHYCIVG CGYKAYSWDA RYEGGTAPGE NAFGVDLSRQ 
QPAETAAWYA PSMYNIVRQN GRDVHIVIKP DKACVVNSGL GSVRGARIAE MSYSRQRNTQ
LQRLTDPQVW RYGQLQPTSW DDALDLVARV TAAVIAEQGE DGLFVSAFDH GGAGGGYENT
WGTGKLYFGA MKVKNIRIHN RPAYNSEVHG SRDMGVGELN NCYEDAELAD TLVAVGTNAL
ETQTNYFLNH WVPNLRGTSL DKKKAEFGSE PVAKARIVIV DPRRTVTVNA SEVEAGKENV
LHLALNSGTD LILFNAWLTY AAEKGWIDKG FIAASTKDFD KALAANKVSV AEAARATGLS
EADIVKAVTW IGEPKAGGAR RRTMFAYEKG LIWGNDNYRT NQALVNLALA TGNIGRPGGG
CVRMGGHQEG YCRPSDAHVG RPAAYVDKLL IEGKGGVHHI WGCDHYKTTL NAMAFKRAYK
MRTDLVKDAM ASVPYGDRDA MVAAILGAIR KGGLFSVDVD IVPTHIGEAA HVMLPAATSG
EMNLTSMNGE RRMRLTERYM DPPGQAMPDC LIAARIANHM ERVLRATGKA EAADKFKGFD
WKSEEDAFMD GYAKNEKGGA FVTYDRLRTM GTNGFQEPAT AFADGKIVGT KRLFADGKFN
KPDGKAVFAE TRWRGLQAPG KQAEKDKFAF LINNGRANLV WQSAYLDVEN ELVMDRWPYP
FIEMNPQDMA ELGLKSGDLV EVYNENGSTQ AMAYPTPTAK RKETFMLFGF PTGVQGNVVS
AGVNEDIIPN YKQTWGNIRK IADAPEGVRH LTFKSKEYPA