Gene Arth_3305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3305 
Symbol 
ID4443999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3710115 
End bp3712157 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content68% 
IMG OID639691129 
Productshort chain dehydrogenase 
Protein accessionYP_832781 
Protein GI116671848 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG3347] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02632] rhamnulose-1-phosphate aldolase/alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCA CGAACAAGAC TGTTGAAGAC CTGATTGCCC GGTCCAACCG CCTGGGGGCG 
GACAAGCGGA ACACCAACTT CGCCGGCGGC AACACCTCGG CGAAGGGCGC CGAGAAGGAT
CCGGTCACCG GCGAGGACGT CCAGCTCCTC TGGGTGAAGG GTTCCGGCGG GGACCTGGGA
ACGCTGAAGC CGGAAAACCT TGCCGTGCTC CGCCTGGACC GGCTGAACGC ACTGAAGAAC
GTCTACCCCG GGGTGGACCG CGAAGATGAA ATGGTGGCTG CTTTTGACTA CTGCCTGCAC
GGCAAGGGCG GCGCTGCACC GTCGATCGAT ACCGCCATGC ACGGGCTCGT GGACGCCGCG
CACGTGGACC ACCTGCACCC GGATTCCGGC ATCGCGATTG CCACGGCCGT GGACGGGGAG
TCGCTGACCA CCAAGATCTT CGGTGACAAG GTGGTGTGGG TTCCCTGGCG CCGCCCCGGG
TTCCAGCTGG GCATGGACAT CGCCGCGATC AAGGAAGCCA ACCCGCATGC CGTGGGCACC
ATCCTGGGCG GCCACGGCAT CACCGCCTGG GGCGCCACCA GCGAAGAAGC CGAAGCCAAC
TCGCTGTGGA TCATCGACCA AGCTGAGAAA TTCATCGCGG AAAACGGACG CCCCGAACCC
TTCGGTCCGA AGCTGCCCGG CTACGGGGCC CTGCCGGAGG CAGAGCGCCG CGCCAAGGCT
GCCGCGCTGG CGCCGGTGAT CCGCGGGCTG GCGTCCACGG ACAAACCGCA GCTGGGGCAC
TTCAGCGATG ACGCCGTCGT CCTTGACTTC CTGGAAGCCG CCGAACACCC GCGCCTGGGC
GCGCTGGGCA CGTCCTGCCC GGACCACTTC CTGCGCACCA AGGTCAAGCC GCTGATCCTG
GACCTGCCCG CGGATGCGTC CGTTGAGGAT TCGATCACCC GGCTGCAGGA ACTCCACGCC
GACTACCGCG AGGACTACCA GGCCTACTAC GACCGTCACG CGGTTCCGGA GTCCCCGGCG
CTGCGCGGCG CGGACCCGGC GATCGTGCTG GTGCCGGGTG TGGGCATGTT CTCCTTCGGC
GCGAACAAGC AGACGGCACG CGTGGCCGGT GAGTTCTATC TCAACGCCAT CAACGTGATG
CGCGGCGCCG AGGCGATCTC CACCTACGCC CCGATCGGGG AATCCGAGAA GTTCCGGATC
GAGTACTGGT CCCTCGAGGA AGCCAAGCTC GCCCGCCTGC CCAAGCCCAA ATCCCACGCC
ACCCGCATCG CGCTGGTCAC GGGCGCAGCC TCGGGGATCG GCAAGGCCAT CGCCACCCGC
CTCGCCGCGG AAGGCGCCTG CGTAGTCATC GCCGACCTGA ACCTGGAGAA CGCCCGGGCC
GTCGCCGCGG AACTCGGCGG CCCGGACGTG GCCATCGGCG TGCAGGCGGA TGTCACGGAC
GAAGCCCAGA TCGCCGCCGC CATCCAGGAA GCGGTGCTGG CGTTCGGCGG CGTGGACCTG
GTGGTCAACA ACGCCGGGCT CTCCATCTCC AAGCCGCTGC TGGAAACCAC CGAAAAGGAC
TGGGACCTCC AGCACAACGT CATGGCCAAG GGCTCCTTCC TGGTGTCCAA AGCTGCGGCG
AAGGTCATGA TCGGCCAGGA CATGGGCGGG GACATCATCT ACATCTCCTC GAAGAACTCC
GTCTTCGCCG GCCCGAACAA CATCGCGTAC TCCGCCACCA AGGCCGACCA GGCCCACCAG
GTCCGGCTCC TTGCGGCGGA ATTGGGCGAA TACGGTATCC GCGTCAACGG CATCAACCCC
GACGGCGTGG TCCGCGGCTC CGGGATCTTC GCCGGCGGCT GGGGCGCCAA GCGCGCCGCT
GTCTACGGGG TGGACGAGCA GGAACTGGGC AAGTACTACG CCCAGCGCAC CCTGCTCAAG
CGCGAAGTCC TGCCGGAGAA CGTGGCCAAC GCCGCCGCCG TGCTCACCAG CGCCGAACTC
TCCCACACCA CCGGCCTCCA CATCCCCGTG GACGCCGGCG TGGCCGCCGC CTTCCTGCGG
TGA
 
Protein sequence
MSSTNKTVED LIARSNRLGA DKRNTNFAGG NTSAKGAEKD PVTGEDVQLL WVKGSGGDLG 
TLKPENLAVL RLDRLNALKN VYPGVDREDE MVAAFDYCLH GKGGAAPSID TAMHGLVDAA
HVDHLHPDSG IAIATAVDGE SLTTKIFGDK VVWVPWRRPG FQLGMDIAAI KEANPHAVGT
ILGGHGITAW GATSEEAEAN SLWIIDQAEK FIAENGRPEP FGPKLPGYGA LPEAERRAKA
AALAPVIRGL ASTDKPQLGH FSDDAVVLDF LEAAEHPRLG ALGTSCPDHF LRTKVKPLIL
DLPADASVED SITRLQELHA DYREDYQAYY DRHAVPESPA LRGADPAIVL VPGVGMFSFG
ANKQTARVAG EFYLNAINVM RGAEAISTYA PIGESEKFRI EYWSLEEAKL ARLPKPKSHA
TRIALVTGAA SGIGKAIATR LAAEGACVVI ADLNLENARA VAAELGGPDV AIGVQADVTD
EAQIAAAIQE AVLAFGGVDL VVNNAGLSIS KPLLETTEKD WDLQHNVMAK GSFLVSKAAA
KVMIGQDMGG DIIYISSKNS VFAGPNNIAY SATKADQAHQ VRLLAAELGE YGIRVNGINP
DGVVRGSGIF AGGWGAKRAA VYGVDEQELG KYYAQRTLLK REVLPENVAN AAAVLTSAEL
SHTTGLHIPV DAGVAAAFLR