Gene Arth_3407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3407 
Symbol 
ID4444137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3832711 
End bp3834288 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content68% 
IMG OID639691231 
Productoxidoreductase, molybdopterin binding 
Protein accessionYP_832882 
Protein GI116671949 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGC TTACGAACTG GCTCAAGGGT CCCACTGCGC TGGCCGCGCT GGCAGGCGTG 
GCTGCGGCCG CCGTCGTACT TTCCGTTGCG GAACTGATCG GCGCCTTCTT CACGGCGCGG
GCAACGCCGC TCATTGCCCT CGGCTCAACC TTTATCGACT TCACGCCGCC GTGGCTGAAG
GATTTCGCCA TTGCCACGTT CGGCACCAAT GACAAGGCGG CACTGTTCGC GGGGATGGGA
CTGACCATCT TCCTGCTCGC CTGCGTGCTC GGCGTCGTGG CGTACCGGAA GTGGTCGCTG
GGCGTGGCCG GCGTCCTCCT GATGGGCGCG GTCATTGTTG CCAGCGTTGT GACCCGGGCC
AGCGTGGAAC CGCTGGACGC CATCCCTTCG CTGATCGGCA CCGCGGCCGG GCTGGTGGTG
CTGCGGCTGC TGATCACGCG GCTGTGGCGG ATGCGTTCAT GGCCCGGCGT CGCCGCGGAT
GTCGCCGCCA AGGACACCGA ACGCCCGGCC ACCACCCGCC GCGCCTTCTT CGCCGCCAGC
GGGATCACCG CAGTGGCGGC TGCCATCGCG GCAACCGGCG GCCGGCTGCT CAGCGCGGCA
CGGAGCAATA TTGCCCAGGC CCGCGAATCG CTCCAGCTGC CTTCCCCGGC CAAGGCAGCC
CCCGCTGTAC CGGCGGGTGT CCAGTCCGCC GCCCCGGGGG TCACGCCGTG GATTACCCCC
AACAACGAGT TCTACCGGAT CGACACGGCC CTGAGCGTGC CGGAGATCAA CGCCGAGGAA
TGGGAGCTTC GCGTCCACGG CCTCGTGGAG CAGGAAGTCA CCCTGACGTT CCAGGACCTG
CTCGACGCCG AACTGATTGA GTCCCACGTG ACACTCACGT GCGTGTCGAA TCCCGTGGGC
GGCAACCTGG CGGGCAACGC CAAATGGCTG GGACTGCCCC TCCGTGAGGT CCTCAAGATG
GCCCGCCCCA AGGACGGCGC AGACATGGTG CTGTCCACCT CCGAGGATGG CTTCAGCGCG
TCGACTCCGC TCGAGGTGCT GCAGGATGAC CGCGACGCCA TGCTGGCGAT CGGCATGAAC
GGCGAGCCCC TGCCGCTGGA ACACGGCTAC CCCGTCCGCA TGGTGGTCCC GGGCCTATAC
GGTTTTGTCT CCGCCACCAA ATGGGTGGTG GACCTTGAAG TGACCCGCTT CGCTGACAGC
AAGGCCTACT GGACGGACCG CGGCTGGTCC GAGCGCGGTC CCATCAAGAC CATGGCCCGG
GTGGAGGTGC CCAAGTCCTT CGCGCAGGTC CCGGTCGGCA AAGTGGCCAT CGGCGGCACC
GCCTGGGCGC AGACGCGCGG CATTACCAAG GTGGAGGTGC AGATCGACAA CGGCCCCTGG
ACCGAGGCGG TGCTGTCCAC CGAAGCATCC GTGGTGACGT GGCGCCAGTG GTCGTTCGAA
TGGGACGCCA CCCCCGGCCC GCATTACATC AAGGCCCGGG CCACGGACGG TACCGGCGAG
GTCCAGACGG ACAAGCGCGC CGATCCCGTG CCCGACGGCG CTTCCGGCTG GCAGTCGGTT
ATGGTGACCG TGCAATAG
 
Protein sequence
MKKLTNWLKG PTALAALAGV AAAAVVLSVA ELIGAFFTAR ATPLIALGST FIDFTPPWLK 
DFAIATFGTN DKAALFAGMG LTIFLLACVL GVVAYRKWSL GVAGVLLMGA VIVASVVTRA
SVEPLDAIPS LIGTAAGLVV LRLLITRLWR MRSWPGVAAD VAAKDTERPA TTRRAFFAAS
GITAVAAAIA ATGGRLLSAA RSNIAQARES LQLPSPAKAA PAVPAGVQSA APGVTPWITP
NNEFYRIDTA LSVPEINAEE WELRVHGLVE QEVTLTFQDL LDAELIESHV TLTCVSNPVG
GNLAGNAKWL GLPLREVLKM ARPKDGADMV LSTSEDGFSA STPLEVLQDD RDAMLAIGMN
GEPLPLEHGY PVRMVVPGLY GFVSATKWVV DLEVTRFADS KAYWTDRGWS ERGPIKTMAR
VEVPKSFAQV PVGKVAIGGT AWAQTRGITK VEVQIDNGPW TEAVLSTEAS VVTWRQWSFE
WDATPGPHYI KARATDGTGE VQTDKRADPV PDGASGWQSV MVTVQ