Gene Arth_2623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2623 
Symbol 
ID4444864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2940010 
End bp2941326 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content66% 
IMG OID639690442 
Producthomoserine dehydrogenase 
Protein accessionYP_832102 
Protein GI116671169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.760632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAT TGCGAACCCT GAAGGTAGCC CTGCTGGGCT GTGGCAACGT TGGGGCCCAG 
GTGGCGCGGA TTCTCATTGA TGACGCTGAC GCACTGGCCG CACGCACCGG AGCCCGCCTG
GAGCTGACCG GCATCGCCGT ACGCACCATC GATGCACCCC GTGACGTTGA GCTGCCGCGG
GAACTCTTCA CTACGGACGC AGACACCTTG GTCAAGGACG CGGACCTCGT GATCGAACTG
ATGGGCGGCA TCGAACCTGC CCGTTCCCTG ATTCTCGCCG CCATCCAGAA CGGGGCCTGC
GTGGTCACCG GCAACAAGGC CCTCCTGGCC CAGGACGGCC CCACGCTCTA TGAAGCTGCG
GACAAAGCAG GCGTCCAGCT GTCCTACGAA GCAGCGGTGG CCGGCGCCAT CCCCATCCTG
CGCCCCATCC GCGACAGCCT CTCCGGTGAC CGTATTACCC GGGTGCTGGG CATCGTGAAC
GGCACCACCA ACTTCATCCT GGACCAGATG GACACCACGG GCGCGACGTT CGCGGACGCC
CTGGCCGAGG CGCAGCGCCT GGGCTACGCG GAGGCGGACC CCACCGCCGA TGTGGGCGGA
CTCGACGCCG CGGCCAAGGC AGCCATCCTC GCCTCGCTGT CCTTCCATAC CCGTTTTGAC
CTCGAAAACG TCCATTGCGA AGGCATCACC GGCGTCAGCG CGGCGGACAT CGCCGCGGCC
AAGGACGCCG GGTTCGTCAT CAAGCTGCTG GCCATTGCTG AGAAGCTGAC CGCCGCAGAC
GGTAGCGAGG GCGTGTCTGT CCGAGTGCAC CCCACCCTGC TGCCGCGCGA GCACCCGCTG
GCCGCCGTCC ACGGTGCCTT TAATGCCGTC TTCGTTGAGG CCGAGAATGC CGGCGAGCTG
ATGTTCTACG GCCAGGGCGC CGGCGGTACT CCCACGGCAT CGGCCGTACT GGGCGACCTC
GTCTCCGCCG CACGCCGGAT TGTCCTGGGC GGCCCTGCGC AGACCGAGAC CACCATTGGC
AAGGTCCCCG CCCTGCCGAT TGACGCCGTC AACACCAGCT ACTACATCGG CCTTGACGTT
GCCGACCAGC CGGGTGTGCT GGCAAAGATC GCCCAACTGT TCGCGGAGCA CGGCGTGTCC
ATCGAGATCA TGCGCCAGAC CATCCACCGC GACGCCGACT CCAACGTGGA ATCGGCCGAA
CTGCGGATCG TCACCCACCG CGCAACCGAA GCTGCACTGG CAGCAACCGT CCAGGCCGTG
AAGGGCCTCG ACGTCATCAA TTCCGTTACA TCCGTACTGC GGGTAGAAGG AGTCTAA
 
Protein sequence
MTELRTLKVA LLGCGNVGAQ VARILIDDAD ALAARTGARL ELTGIAVRTI DAPRDVELPR 
ELFTTDADTL VKDADLVIEL MGGIEPARSL ILAAIQNGAC VVTGNKALLA QDGPTLYEAA
DKAGVQLSYE AAVAGAIPIL RPIRDSLSGD RITRVLGIVN GTTNFILDQM DTTGATFADA
LAEAQRLGYA EADPTADVGG LDAAAKAAIL ASLSFHTRFD LENVHCEGIT GVSAADIAAA
KDAGFVIKLL AIAEKLTAAD GSEGVSVRVH PTLLPREHPL AAVHGAFNAV FVEAENAGEL
MFYGQGAGGT PTASAVLGDL VSAARRIVLG GPAQTETTIG KVPALPIDAV NTSYYIGLDV
ADQPGVLAKI AQLFAEHGVS IEIMRQTIHR DADSNVESAE LRIVTHRATE AALAATVQAV
KGLDVINSVT SVLRVEGV