Gene Arth_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1950 
Symbol 
ID4445534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2200315 
End bp2201457 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content64% 
IMG OID639689760 
Productoxidoreductase domain-containing protein 
Protein accessionYP_831432 
Protein GI116670499 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTGC CTTCTGCGGA ATCGGTCCGG ACCATTCGTT ACGGCCTTAT CGGTGCCGGC 
CACATGGCCC GCGAACACGT CCGGAACCTT GCCCTGATCC CGGGAAGCCA CATCACCGCC
GTATCCGATC CCCAGCCGTC ATCGCTGGCG GAGACCGTTG CGGAAATCGG CTACGAGGTA
CAGACCTTCT CCCGTCACCA GGACCTGCTC GCGTCCGGAC TGGTGGATGC ATTGGTGATC
GCCAGCCCCA ACGACACGCA CCTGGCCATC CTCAAGGACA TCTTCGCGAG TGGCACCAAC
CTGCCCGTGC TGGTGGAGAA GCCCGTGTGC ACCACTGCGG AACAGGCCGA CGAGCTTGAA
GCGCTGGCAG CTGACTACAC CGCGCCGGTG TGGGTTGCCA TGGAGTACCG CTACATGCCG
CCGGTGCAGG AAATCATCCA GGCGGCCCAC GGCGGCAGGC TCGGCAACGT GTACATGCTA
TCCATCGTGG AGCACCGCTT CCCGTTCCTG CACAAGGTGG ACGCCTGGAA TCGCTTCACG
GAGCGGACCG GAGGCACGCT GGTGGAAAAG TGCTGCCACT TCTTCGACCT GATGCGGCTG
ATCCTGCAGG ACGAACCCGT GCGCGTCTAC GCCAGCGGCG GCCACGACGT CAACCACATG
GACGAGGTGT ATGACGGCAG GGTGTCAGAC ATGATCGACA ACGCCTACGT GATTGTGGAC
TTCAAGGGCG GGCGCCGGGC CATGCTGGAG CTGTCCATGT TCGCGGAGGG CTCCAAGTTC
CAGGAGCGGA TCTCCATTGT GGGCGACGCC GCCAAGATCG AGACCCTCAT CCCGGTGGCG
GCCAACCACT GGATCGAGGG CGACGAGGCC GAGGCGACGG TGGAATTCAG CCCGCGCTCG
CCGCTGGGGC CGGAAATGCA CGAGGTTCCT GTAGATGAGG CCGTCCTCGC TGCCGGCGCC
CACCACGGCT CCACGTACTA TGAACACCTT GGCTACCGCA AAGCCATCCT GGGTGACGGG
CCGGTGGAAG TTACGGTTGC CGACGGCCTG CAGTCCGTGC GCATGGGCTT GGCGGCCGAG
CGCTCCATCA TCGAAGGACG CCCCGTAGAG CTGACGAATG CCGGTGCCGG GCTCAGTCAC
TGA
 
Protein sequence
MSLPSAESVR TIRYGLIGAG HMAREHVRNL ALIPGSHITA VSDPQPSSLA ETVAEIGYEV 
QTFSRHQDLL ASGLVDALVI ASPNDTHLAI LKDIFASGTN LPVLVEKPVC TTAEQADELE
ALAADYTAPV WVAMEYRYMP PVQEIIQAAH GGRLGNVYML SIVEHRFPFL HKVDAWNRFT
ERTGGTLVEK CCHFFDLMRL ILQDEPVRVY ASGGHDVNHM DEVYDGRVSD MIDNAYVIVD
FKGGRRAMLE LSMFAEGSKF QERISIVGDA AKIETLIPVA ANHWIEGDEA EATVEFSPRS
PLGPEMHEVP VDEAVLAAGA HHGSTYYEHL GYRKAILGDG PVEVTVADGL QSVRMGLAAE
RSIIEGRPVE LTNAGAGLSH