Gene Arth_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1778 
Symbol 
ID4445677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1989623 
End bp1991065 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content65% 
IMG OID639689596 
Productmercuric reductase 
Protein accessionYP_831268 
Protein GI116670335 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID[TIGR02053] mercuric reductase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.3364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGCAG CATCTTTCGA TTATGACCTG GCCATTATCG GTTCCGGCGG GGCTGCTTTC 
GCTGCGGCCA TCCGGGCAAC CAGCCGTGGC AAGCGGGTGT TGATGGTGGA GCGCAGCACT
GTGGGAGGCA CGTGCGTGAA CACGGGCTGC ATCCCGTCCA AGGCCCTGCT GGCCGCCGCG
GAAGCCCGCC ATGTCGCCCT CGATGCTTCC GGACGGTTCC CCGGTATCAG CACCTCCGCA
GAGCCGGTGG ATATGCCCGA ACTGGTCGCC GGGAAGCGCT CACTGGTCGA ATCCATGCGG
TCAGAGAAGT ATGTGGATCT CGCCGCGGGC TATGGATGGA ACCTGCAGCG GGGGACGGCG
GTGTTCGCCG GAACCGCAGC CGCACCGGTT TTGAACATCA CCGCCCCGGG CGGAACCACC
GAGACAGTCA GCGCGGAACA CTACCTGGTC GCGACCGGCT CCACCCCCTG GATCCCTGAA
GTGCCGGGAA TGGACGAGGT GGATTATCTG ACGTCCACGA GTGCGATGGA GCTGCAGGAC
GTTCCCGCTT CGATGCTGGT GGTGGGCGGC GGGTATGTGG CGCTGGAGCA GGCGCAGCTT
TTCGCCCGGC TCGGCACGGA GGTGACCATC CTGGTCCGGT CCAAGCTCGC CTCGGCCGAA
GAGCCTGAAG CCGGGCATGC CCTCGCCGGT GTCTTCGCCG ATGAGGGTAT CCGGGTCGTC
CGTCGAGCGA CAGCGTCCTC GGTCCGGACC GATGAGGTGT CGGGGGACGT GGTCGTGGAT
GCTTCCGTCT CAGGAGGAAA CGAGGAATTC AGGGCCGCGC GCCTGCTCAT GGCAACAGGC
CGGCGCCCGG TCACGGAGGA TTTGAACCTT TGCATGGTCG GCGTTGAAAC CGGGGACCGC
GGGGAAGTCC TGGTCGACGG GAGCCTTCGC AGTACTAATC CGAGGATCTG GGCCGCGGGT
GATGTGACGG GTCACCCGGA GTTCGTTTAT GTCGCCGCCG CGCACGGGGC CCTGATGGTG
GAGAACGCCT TTGAGGGTGC CGGGCGTGAG GTCGATTACC GGCACCTGCC CCGGGTCACG
TTTACCAGCC CTGCCCTGGC CGCTGTCGGG ATGACGGACA AGGAAGCGAA CCAGGCAGGG
ATCCGGTGCA TGTGCCGGGT TCTGCCGCTC AAATTTATCC CTCGCGCGCT GGTGAACCGT
GATACCCGCG GCTTCATCAA GATCGTTGCC GACGCGGACA CGGGTCGGAT TGTAGGGATC
ACTGTCGTGG GTAAGGACGC CGGGGACATC GCCGCCGCAG GGATTTACAT TCTGGAGGCC
GGGATGACCG TTGATCAGGT CGCGAATCTC TGGAGCCCCT ATCTGACCAT GGCCGAAGGC
ATCAAGATAG CAGCCCAGTC CTTCACTACT GACGTCTCCA AACTGTCCTG TTGCGCGGCA
TGA
 
Protein sequence
MSAASFDYDL AIIGSGGAAF AAAIRATSRG KRVLMVERST VGGTCVNTGC IPSKALLAAA 
EARHVALDAS GRFPGISTSA EPVDMPELVA GKRSLVESMR SEKYVDLAAG YGWNLQRGTA
VFAGTAAAPV LNITAPGGTT ETVSAEHYLV ATGSTPWIPE VPGMDEVDYL TSTSAMELQD
VPASMLVVGG GYVALEQAQL FARLGTEVTI LVRSKLASAE EPEAGHALAG VFADEGIRVV
RRATASSVRT DEVSGDVVVD ASVSGGNEEF RAARLLMATG RRPVTEDLNL CMVGVETGDR
GEVLVDGSLR STNPRIWAAG DVTGHPEFVY VAAAHGALMV ENAFEGAGRE VDYRHLPRVT
FTSPALAAVG MTDKEANQAG IRCMCRVLPL KFIPRALVNR DTRGFIKIVA DADTGRIVGI
TVVGKDAGDI AAAGIYILEA GMTVDQVANL WSPYLTMAEG IKIAAQSFTT DVSKLSCCAA