Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3850 |
Symbol | |
ID | 4447549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4331785 |
End bp | 4332723 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639691674 |
Product | 4-amino-4-deoxychorismate lyase |
Protein accession | YP_833325 |
Protein GI | 116672392 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCTG CTGCCCCGGC CACCGTTCTC GTATTCCTTG ATCCCCGGTT CGACGACGGC CGGATTGCCG ATGCTTCGCA GCCGCAGCTG ATGGCCACGG ACCAAGGTGC CACGCGCGGC GACGGCGTGT TTGAGTCCAT GCTGGCCGTG GGCGGAAACC CGCGGAAGCT GGACGCCCAC CTGCGGCGGC TGCAGGGTTC GGCCCGCGCG CTGGAGCTGG ACATCCCGGG CGAGGACACC TGGCGCCGGG CCATAGCGAC GGCGGTGGCC GAATACCGTT CCCAGCATCC CGCCGGCACA CCGGAGGAAG ACGAGACGGT GGTGAAGCTG ATCTGCACCC GCGGCGCCGA GGGCGGAGCA CGCCCCACCT GCTGGGTCCA GGCTTCCCCC GTCCCCGCCG CCGGCCGGCG CCAGCGCGAG ACGGGGATAG ACGTCATCCT CCTGGACCGC GGCTACGACA GTGAAGTGGG TGAACGGGCG CCGTGGCTGC TGCTGGGCGC CAAGACGCTC TCCTACGCGG TGAACATGGC GGCCCTGCGG TACGCCCACA ACCAGGGGGC GGACGACGTC ATCTTCACCT CGTCCGACGG ACGGGTCCTC GAAGGCCCCA CGTCCACCGT CCTGCTGGCG CACCTTGACA CAGTCGACGA CGGCGGCACC CGCACGGTGC GCCGCCTCAT CACGCCGCAG CTGGACAGCG GCATCCTCCC GGGAACGTCT CAGGGCGCGC TCTTTGCTGC CGCCAAAGCT GCGGGCTGGG AACTCGGCTA CGGCCCGCTG GAGCCGAGGG ACCTTTTCGA CGCCGACGCC GTGTGGCTGA TTTCCAGCAT CCGGCTGCTG GCTCCCGTGA ACCATATCGA CGGCAAGGAA ATCGGCACGC CCGCCCTCCG GAAGCAGCTG ACCGACGAGC TCAACCAGCT GTTCGCCACG ATCGAATAG
|
Protein sequence | MTPAAPATVL VFLDPRFDDG RIADASQPQL MATDQGATRG DGVFESMLAV GGNPRKLDAH LRRLQGSARA LELDIPGEDT WRRAIATAVA EYRSQHPAGT PEEDETVVKL ICTRGAEGGA RPTCWVQASP VPAAGRRQRE TGIDVILLDR GYDSEVGERA PWLLLGAKTL SYAVNMAALR YAHNQGADDV IFTSSDGRVL EGPTSTVLLA HLDTVDDGGT RTVRRLITPQ LDSGILPGTS QGALFAAAKA AGWELGYGPL EPRDLFDADA VWLISSIRLL APVNHIDGKE IGTPALRKQL TDELNQLFAT IE
|
| |