Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2623 |
Symbol | |
ID | 4444864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2940010 |
End bp | 2941326 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690442 |
Product | homoserine dehydrogenase |
Protein accession | YP_832102 |
Protein GI | 116671169 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.760632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAT TGCGAACCCT GAAGGTAGCC CTGCTGGGCT GTGGCAACGT TGGGGCCCAG GTGGCGCGGA TTCTCATTGA TGACGCTGAC GCACTGGCCG CACGCACCGG AGCCCGCCTG GAGCTGACCG GCATCGCCGT ACGCACCATC GATGCACCCC GTGACGTTGA GCTGCCGCGG GAACTCTTCA CTACGGACGC AGACACCTTG GTCAAGGACG CGGACCTCGT GATCGAACTG ATGGGCGGCA TCGAACCTGC CCGTTCCCTG ATTCTCGCCG CCATCCAGAA CGGGGCCTGC GTGGTCACCG GCAACAAGGC CCTCCTGGCC CAGGACGGCC CCACGCTCTA TGAAGCTGCG GACAAAGCAG GCGTCCAGCT GTCCTACGAA GCAGCGGTGG CCGGCGCCAT CCCCATCCTG CGCCCCATCC GCGACAGCCT CTCCGGTGAC CGTATTACCC GGGTGCTGGG CATCGTGAAC GGCACCACCA ACTTCATCCT GGACCAGATG GACACCACGG GCGCGACGTT CGCGGACGCC CTGGCCGAGG CGCAGCGCCT GGGCTACGCG GAGGCGGACC CCACCGCCGA TGTGGGCGGA CTCGACGCCG CGGCCAAGGC AGCCATCCTC GCCTCGCTGT CCTTCCATAC CCGTTTTGAC CTCGAAAACG TCCATTGCGA AGGCATCACC GGCGTCAGCG CGGCGGACAT CGCCGCGGCC AAGGACGCCG GGTTCGTCAT CAAGCTGCTG GCCATTGCTG AGAAGCTGAC CGCCGCAGAC GGTAGCGAGG GCGTGTCTGT CCGAGTGCAC CCCACCCTGC TGCCGCGCGA GCACCCGCTG GCCGCCGTCC ACGGTGCCTT TAATGCCGTC TTCGTTGAGG CCGAGAATGC CGGCGAGCTG ATGTTCTACG GCCAGGGCGC CGGCGGTACT CCCACGGCAT CGGCCGTACT GGGCGACCTC GTCTCCGCCG CACGCCGGAT TGTCCTGGGC GGCCCTGCGC AGACCGAGAC CACCATTGGC AAGGTCCCCG CCCTGCCGAT TGACGCCGTC AACACCAGCT ACTACATCGG CCTTGACGTT GCCGACCAGC CGGGTGTGCT GGCAAAGATC GCCCAACTGT TCGCGGAGCA CGGCGTGTCC ATCGAGATCA TGCGCCAGAC CATCCACCGC GACGCCGACT CCAACGTGGA ATCGGCCGAA CTGCGGATCG TCACCCACCG CGCAACCGAA GCTGCACTGG CAGCAACCGT CCAGGCCGTG AAGGGCCTCG ACGTCATCAA TTCCGTTACA TCCGTACTGC GGGTAGAAGG AGTCTAA
|
Protein sequence | MTELRTLKVA LLGCGNVGAQ VARILIDDAD ALAARTGARL ELTGIAVRTI DAPRDVELPR ELFTTDADTL VKDADLVIEL MGGIEPARSL ILAAIQNGAC VVTGNKALLA QDGPTLYEAA DKAGVQLSYE AAVAGAIPIL RPIRDSLSGD RITRVLGIVN GTTNFILDQM DTTGATFADA LAEAQRLGYA EADPTADVGG LDAAAKAAIL ASLSFHTRFD LENVHCEGIT GVSAADIAAA KDAGFVIKLL AIAEKLTAAD GSEGVSVRVH PTLLPREHPL AAVHGAFNAV FVEAENAGEL MFYGQGAGGT PTASAVLGDL VSAARRIVLG GPAQTETTIG KVPALPIDAV NTSYYIGLDV ADQPGVLAKI AQLFAEHGVS IEIMRQTIHR DADSNVESAE LRIVTHRATE AALAATVQAV KGLDVINSVT SVLRVEGV
|
| |