Gene Arth_3560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3560 
Symbol 
ID4443871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4000689 
End bp4001738 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content67% 
IMG OID639691384 
Producthomoserine dehydrogenase 
Protein accessionYP_833035 
Protein GI116672102 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCT ATGACCTTGC ACTGATCGGC TTCGGCGGAG TGAACCGCAG CCTGGCTGAG 
CTGATCGCCG CCCGCGGGGA AGCACTCCGC GCCGAGGTGG GCTTCGGCCT GCGTGTAGTT
GCCATCACGG ATCTGCGCCT CGGGTCCCTC GTGGACGCCG GCGGTATTGA TCTGTCCATG
GCCCTGGCGC TGGGCGGCGA CGGCGAAACC TTTGCCCGGC ACGGCGGTTC CGCGGAGCCG
GATAACGAGG GCGTGATCCG CAACTGCGCA GCGGACATCA TCTGCGAGGC CACCTTCACC
AATCCCGAGG ACGGCGAGCC CGCGGTGTCG CACGTCCGCT GGGCACTGGA GTCGGGCAAG
AGCGTCTGCA CCACCAACAA GGGACCCGTG GCGCTTCGGG GCCGGGAACT GGCAGCCCTG
GCGGAACGGC AAGGTGTCCG CTTCGAGTTT GAAGGTGCCG TCATGAGCGG CACTCCGGTG
ATCCGGCTTG CCAAGCAGAT GTTCGGCGGC CTGCAGCTCA ACGGCTTCGA AGGGATCATG
AACGGCACCA GCAACTACGT TCTGGGGCGC ATGGAAGCCG GCTTTGAACT TGCTGAGGCT
GTCCGGGAGG CCCAGGAGCT GGGTTACGCA GAGGCGAATC CCGCCGCGGA CCTCGAGGGT
TTCGACGTGC AGCTGAAGGT GCTGATCCTC GCCAACGAAT TGCTTGGCGG GAACCTGGAA
CTGAAGGACG TCCGCCGGGA GGGAATCTCC GCGCTGACGC CCGGGGATAT CCGGGCAGCC
GCTTCGTCCG GCCGGCGCTG GAAGCTGATC GGATCCGCCA GGCGGACTCC CGACGGCGGC
ATCGCGGCAA GCGTTGCACC GCGCGCCGTT GATGTTGCGC ACAGCCTTGC CGGCATTTCG
GGCGCCACGA ACGCGGTTTC GTTCGAGACC GACCTGCTGG GCCCCGTGAC GGTTTCCGGC
CCCGGCGCCG GACGCATCGA GACGGCCTAC GCGCTGCTCT CCGACATCAT GGCCATCCAC
AAGATGTCGG AAGGACTGGC ACGTGTCTGA
 
Protein sequence
MTTYDLALIG FGGVNRSLAE LIAARGEALR AEVGFGLRVV AITDLRLGSL VDAGGIDLSM 
ALALGGDGET FARHGGSAEP DNEGVIRNCA ADIICEATFT NPEDGEPAVS HVRWALESGK
SVCTTNKGPV ALRGRELAAL AERQGVRFEF EGAVMSGTPV IRLAKQMFGG LQLNGFEGIM
NGTSNYVLGR MEAGFELAEA VREAQELGYA EANPAADLEG FDVQLKVLIL ANELLGGNLE
LKDVRREGIS ALTPGDIRAA ASSGRRWKLI GSARRTPDGG IAASVAPRAV DVAHSLAGIS
GATNAVSFET DLLGPVTVSG PGAGRIETAY ALLSDIMAIH KMSEGLARV