Gene Arth_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3951 
Symbol 
ID4447769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4464695 
End bp4465804 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content68% 
IMG OID639691782 
Productnitrate reductase (NADH) 
Protein accessionYP_833426 
Protein GI116672493 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGC AGATTTCCCG GCGCCGTCAG TCCGTGAAGC CCGCGGCGCA CTCAGGAGAG 
CCCACCCACG GTCCGCTCAC CGCAGAGGAA CTGCAGCTGG CCGTCAGGAA CCATTCGATG
CCATTGGAGG CCTTGCGGGA GGCCACCACA CCGCCGGGGC TGCACTACGT CCTGACGCAC
TTCGACATCC CGTTCATTGA CGCCGACTCC TGGCACCTGC GGATCGGCGG TGCCGTGCAG
CGCGCCGTCG AGATCAACCT CCGGGCGCTT CGCCGGGACC CGACCATCAG CATTCCGGTC
ACGCTGGAGT GCGCCGGCAA CGGCCGCTCG CTGCTGCATC CGCGGCCGCT GAGCCAGCCG
TGGCGGCTCG AGGGCGTGGG AACGGCGGAG TGGACCGGGG TTCCGCTCGC GTACCTGCTG
GCCCAGGCCG GCGTTGACGA GGACGCCGTC GAAGTGGTGT TCACCGGCGC CGACGCCGGC
ATCCAGGGCG GAGTCCGGCA GACGTATGCG CGCAGCCTTC CGATCAAGGA GGCGATGCGC
CCCGATGTCG TCCTGGCGTA TGAGATGAAC GGTCGCGAGC TGCCGCCGCA GCACGGCTAC
CCCCTGCGCC TTGTGGTCCC TGGCTGGTAC GGCATGGCCA GCGTGAAGTG GCTGGAGTCC
ATTCAGGTGC TGACCCATCC GTTCGAGGGA TTCCAGCAGT CGGTGGCGTA CCGCTACCAG
AAGGACGCGG ACGACGCCGG CACTCCGGTC TCCCGGATCA AGGTGCGTTC GCTGATGATT
CCGCCGGGCA TCCCGGACTT CTTCACCCGC AGCAGGGTCC TCTCCGCCGG CCCGGTCATG
CTTACGGGCA GGGCCTGGTC CGGTGAAGGC TCCGTGGTCC GCGTGGAAGT GGGGATTGAC
GGGAAATGGG TGACCGCGCA CCTCGGACAC CCGGCGGGGC CGTTTGCCTG GTGCGAATGG
ACGCTGCCGT GGGTGGCGGA CCGGGGCGAG CATGAGCTCG CTTGCCGGGC CACCGACGCC
ACGGGATCAA CGCAGCCGCT GGAGCAGGTC TGGAACTACC AGGGCATGGG CAACAACGTG
GTGCAGCGCG TGAAGGTGAG CGTCGAGTAG
 
Protein sequence
MTKQISRRRQ SVKPAAHSGE PTHGPLTAEE LQLAVRNHSM PLEALREATT PPGLHYVLTH 
FDIPFIDADS WHLRIGGAVQ RAVEINLRAL RRDPTISIPV TLECAGNGRS LLHPRPLSQP
WRLEGVGTAE WTGVPLAYLL AQAGVDEDAV EVVFTGADAG IQGGVRQTYA RSLPIKEAMR
PDVVLAYEMN GRELPPQHGY PLRLVVPGWY GMASVKWLES IQVLTHPFEG FQQSVAYRYQ
KDADDAGTPV SRIKVRSLMI PPGIPDFFTR SRVLSAGPVM LTGRAWSGEG SVVRVEVGID
GKWVTAHLGH PAGPFAWCEW TLPWVADRGE HELACRATDA TGSTQPLEQV WNYQGMGNNV
VQRVKVSVE