Gene Arth_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2139 
Symbol 
ID4445216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2409657 
End bp2411108 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content64% 
IMG OID639689947 
Productputative short chain dehydrogenase 
Protein accessionYP_831619 
Protein GI116670686 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.538588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCCT CCGATTTGAC GCCTGAGGAC ATCCAGGCCT GCCTCAAGGT TCTTAACACC 
ATCCACGCCT ATGACGAGGA GCACCCGGAC TACGTCTCGG TTCGACGCGC CACGGGCAAG
ATGTTCAAGG CTGTCAAACG CCACCGCCGG GTCACCAAGC GCGACCTGAT CGCAGAGTCC
GATCGCGCAG TCATCGCCCA GACGGCTACG GCAGCGCCGG ACCGGATCGA TGACGAAACC
CGCGGGAACA AGCTGGAACC CTCTGCGACC GGCAAGGTGG CCGGACACCT CATCAGGTCC
CGCCCGTGCT ACATCTGCAA GAATCACTAC ACGCAGGTTG ATGCCTTCTA TCACCAGTTG
TGCCCTGAGT GCGCTGCGTT CAGCCACAGC AAGCGCGACG CGCGGACGGA CCTCACCGGC
CGGCGTGCCC TCCTTACGGG AGGTCGCGCC AAAATCGGCA TGTACATCGC CCTGCGGCTG
CTGCGGGACG GTGCCCACAC CACCATCACC ACCCGGTTCC CGAAAGATGC GGCCCGACGC
TTCGCCGCGA TGGAGGACAG CGGTGAGTGG CTCCATCGGC TCAGGATCGT GGGCATCGAC
CTTCGTGATC CCTCCCAGGT AATGGCCCTG ACGGATTCCC TCGACGCCGC GGGCCCGCTG
GACATCATCA TCAACAATGC GGCCCAGACG GTCCGCCGCT CCGGCAACGC CTACAAGCCG
CTGGTCGATG CAGAGGACGA GCCCCTGCCG GCCGCCCTCG ACGCTGCCAA CGGCGGACCG
GAACTGGTGA CCTTCGGCCA CGCCCACGAC AAGCACCCGT TGGCCCTTGC CAGCAGCGTC
ATGGAACACC CGGTCCTGGC CGGCGACGCC ATCACATCCC TGGCACTCTC TACGGGTTCG
GCTTCGCTGG AACGGATAGC CACCGGCACG GCCATCGACG CCGGCGGGCT GGTTCCTGAC
CTGGCCACCA TCAACAGCTG GACGCAGGTG GTGGATGAAG TGGACCCGCT GGAGATGCTC
GAAGTTCAGC TCTGCAACGT GACGGCGCCC TTCCTGCTCG TGAGCCGTCT GCGTGCCGCC
ATGAAGCGCT CCACCGCGCA CCGGAAGTAC ATCGTGAACG TTTCCGCCAT GGAAGGGCAG
TTCTCACGCG CATACAAGGG TCCGGGCCAC CCCCATACCA ACATGGCCAA AGCGGCGCTA
AACATGATGA CCCGCACCAG CGCGCAGGAA ATGCTCGATT CCGACGGCAT CCTGATGACC
GCCGTGGACA CCGGATGGAT CACTGATGAG CGTCCGCATT ACACCAAGGT CAGGCTCATG
GAGGAAGGCT TCCATGCTCC GCTGGACCTC GTGGACGGTG CAGCGAGGGT CTACGATCCG
ATTGTCATGG GAGAAAACGG CGAAGACCAG TACGGCGTCT TCCTCAAGGA CTACAAGCCC
AGCCCCTGGT AG
 
Protein sequence
MNSSDLTPED IQACLKVLNT IHAYDEEHPD YVSVRRATGK MFKAVKRHRR VTKRDLIAES 
DRAVIAQTAT AAPDRIDDET RGNKLEPSAT GKVAGHLIRS RPCYICKNHY TQVDAFYHQL
CPECAAFSHS KRDARTDLTG RRALLTGGRA KIGMYIALRL LRDGAHTTIT TRFPKDAARR
FAAMEDSGEW LHRLRIVGID LRDPSQVMAL TDSLDAAGPL DIIINNAAQT VRRSGNAYKP
LVDAEDEPLP AALDAANGGP ELVTFGHAHD KHPLALASSV MEHPVLAGDA ITSLALSTGS
ASLERIATGT AIDAGGLVPD LATINSWTQV VDEVDPLEML EVQLCNVTAP FLLVSRLRAA
MKRSTAHRKY IVNVSAMEGQ FSRAYKGPGH PHTNMAKAAL NMMTRTSAQE MLDSDGILMT
AVDTGWITDE RPHYTKVRLM EEGFHAPLDL VDGAARVYDP IVMGENGEDQ YGVFLKDYKP
SPW