Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0822 |
Symbol | |
ID | 4446660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 888827 |
End bp | 890791 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688629 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_830320 |
Protein GI | 116669387 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.837061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGACGA GAACTGTAAC CGCAGGAACC CGCCGGATGA CGGTCGCCCA GGCCGTCGTC GAATATCTAT CCAAGCAGTA CACCGTTGAT TCTGTTGGCG GGCTGGATTA CCGCGAACGC CTGATCCCCG GCACGTTCGG CATCTTCGGG CACGGCAACG TTGCCGGAGT GGGCCAGGCG CTCAAGCAGT ACCAGCAGCT GGATCCGGCG ATCATGCCGT ACTACCAGGG CCGCAACGAG CAGGCGCAGG CGCACCAGGC TGTGGGCTAC GCGCGCCACA CCCGCCGCCG GCAGACCTTT GCGATCAGCA CCTCGATCGG GCCGGGTTCC TCCAACCTGC TCACGGGGGC GGCGCTGGCC ACCACCAACC GCCTGCCGGT CCTGCTGCTT CCCAGCGACA CGTTCGCCAC CCGTGCTGCG GATCCGGTGC TGCAGCAGCT CGAACAGCCC TACGCGTACG ACATCACCGT CAATGACGCC TTCCGTCCGC TGTCCAAGTT CTTCGACCGC GTCAACCGGC CCGAGCAGCT GTTCTCCGCG TTCCACCACG GACTGCGCGT ACTGACGGAT CCGGCCGAAA CCGGCGCTGT CACCATCTCC CTGCCGCAGG ATGTCCAGGC CGAAGCCTTA GACGTGCCCG AGGAGTTCCT GGCCGAACGC GAGTGGCGGA TCCGCCGCCC GGACGCGGAC GACGAGGACA TCCGCCGTGC CGCCGAAGCC ATCCGCGCCG CGAAACGCCC GCTGATCATC GCCGGCGGCG GCGTCCTCTA CGCCTACGCC AACGACGAAC TGGCCAGGTT CGTGGAACTC ACCGGCATCC CGGTGGGCAA CACCCAGGCC GGGGTGGGCG TCCTGCCCTG GGACCACAAG TTGTCCCTCG GCGCGATCGG CTCCACGGGC ACGACGGCGG CAAATGCCCT TGCAGCCGAA GCGGACCTGA TCATCGGCAT CGGCACCCGC TACGAGGACT TCACCACCGC CTCGCGGACC GCGTTCCAGA ACCCGGATGT GAAGTTCATC AACATCAACG TAGCCGCCAT GGATGCGTAT AAGCACGGCA CGTCGCTGCC GATCGTCGCG GATGCGCGCA AGGCACTGGT GAAGCTGAAC CAAGCACTCG GCGGCTACCG CGTGGGCGCC GACCTGGAAC AGCAGATCGC GGCCGAGAAG AAGCGCTGGG ATGCCACCGT GGACGAAGCG TTCGACACCC GCTTCACTCC GCTGCCGGCC CAGAACGAAA TCATCGGCGC CACGTCCCGG GCCATGGACG CCCAGGACGT CGTCGTCTGC GCCGCCGGGT CCCTGCCCGG TGATCTGCAC AAGATGTGGC GCGTCCGGGA CCCCTTCGGC TACCACGTGG AATACGCCTA CTCCTGCATG GGCTACGAAA TCCCGGGCGG GCTCGGCGTC AAGCGCGCTG CGCTGGCGGA AGCCGCGCGG GGCGGGGCGG AGCGCGACGT CGTCGTGATG GTGGGGGACG GTTCGTACCT CATGATGCAC ACCGAGCTGG TCACCGCCGT CGCCGAACGG ATCAAGCTGA TCGTGGTCCT GATCCAGAAC CACGGCTACG CCTCGATCGG CTCGCTCTCC GAGTCCCTCG GCTCACAGCG CTTCGGCACC AAGTACCGCG CCCTCGACGG AGACCACCAC AGCTTCGACG AAGGCGAAAC CCTTCCGGTG GACCTCGCCC TGAACGCGCA AAGCCTGGGC GTGAAAGTGA TCCGGATCGA ACCGGGGGAG AAGGTCATCG CCGAACTCGA ACAGGCCATC AGGGACGCCA AAGCCGCCCC CGAGCGCGGC GGGCCCATCC TGATCCATGT CGAATCCGAC CCCCTGCTGG ACGCCCCGAG CTCCGAATCA TGGTGGGACG TTCCCGTCTC CCAGGTCTCC GAGCTCGAAT CCACGAGGCA GGCATTCCAG GCCTACACCG ACCACAAAAA CCGCCAGCGC AAGCTGCTCG GCTAA
|
Protein sequence | MGTRTVTAGT RRMTVAQAVV EYLSKQYTVD SVGGLDYRER LIPGTFGIFG HGNVAGVGQA LKQYQQLDPA IMPYYQGRNE QAQAHQAVGY ARHTRRRQTF AISTSIGPGS SNLLTGAALA TTNRLPVLLL PSDTFATRAA DPVLQQLEQP YAYDITVNDA FRPLSKFFDR VNRPEQLFSA FHHGLRVLTD PAETGAVTIS LPQDVQAEAL DVPEEFLAER EWRIRRPDAD DEDIRRAAEA IRAAKRPLII AGGGVLYAYA NDELARFVEL TGIPVGNTQA GVGVLPWDHK LSLGAIGSTG TTAANALAAE ADLIIGIGTR YEDFTTASRT AFQNPDVKFI NINVAAMDAY KHGTSLPIVA DARKALVKLN QALGGYRVGA DLEQQIAAEK KRWDATVDEA FDTRFTPLPA QNEIIGATSR AMDAQDVVVC AAGSLPGDLH KMWRVRDPFG YHVEYAYSCM GYEIPGGLGV KRAALAEAAR GGAERDVVVM VGDGSYLMMH TELVTAVAER IKLIVVLIQN HGYASIGSLS ESLGSQRFGT KYRALDGDHH SFDEGETLPV DLALNAQSLG VKVIRIEPGE KVIAELEQAI RDAKAAPERG GPILIHVESD PLLDAPSSES WWDVPVSQVS ELESTRQAFQ AYTDHKNRQR KLLG
|
| |