Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1525 |
Symbol | |
ID | 4445959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1698169 |
End bp | 1699689 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689339 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_831019 |
Protein GI | 116670086 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.320923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAAGG AACACCACGT CCGCGTTTAC AAGAGCGAGG AAAACCTGCC CCGCGAGGAC CAGCTCGCGT ACAAGATCGC CAAGGTTGCC ACCGATCCTG TCGCCGTCAC CGACGAGGTT ACCGACATGG TGATCAACCG GGTCATCGAC AACGCCTCGG TGGCCATCGC CTCCCTGAAC CGGGCCCCCA TCGTTGCCGC CCGCGCACAG GCACTCACCC ATGGACCCAC CACGGGCGGC AAGGGATCCA AGGTCTTCGG CATCGACGAG CGCGTGGCAC CGGAATGGGC CGCCTGGGCC AACGGCGTTG CTGTCCGTGA ACTCGACTAC CACGACACCT TCCTGGCAGC GGACTACTCC CACCCCGGCG ACAACATCCC GCCGATCCTC GCCGTCGCCC AGCACGTCGG CTCCAGCGGA CACGACCTCA TCCGCGGCAT TGCCACCGGC TACGAAATCC AGGTGAACCT GGTCAAGGCC ATCTGCCTGC ACAAGCACAA GATCGACCAC GTGGCCCATC TCGGCCCCTC CGCCGCTGCC GGTATCGGCA CCCTCCTCGG GCTCGACGTC GAGACCATCT TCCAGTCCGT GGGCCAGGCC CTTCACACCA CCACCGCCAC GCGCCAGTCG CGCAAGGGCG AGATTTCCAC CTGGAAGGCG CACGCCCCGG CCTTCGCCGG GAAGATGGCC GTAGAAGCCG CGGACCGCTC CATGCGCGGC CAGACGTCGC CGGTCCCGAT CTATGAGGGT GAAGACGGCG TCATCGCCTG GATGCTGGAC GGCCCCGAGG CTTCCTACAT GGTGCCGCTG CCGACTCCCG GTGAAGCCAA GCGCGCCATC CTGGACACGT ACACCAAGGA ACACTCCGCC GAGTACCAGG CCCAGGCCTG GATCGACCTC GCCCGGAAGC TGCACGGCGA GCGCCCCGAA GTCACCGACC CGGCCAACGT GAAATCCGTG CTGATCAAGA CCAGCCACCA CACCCACTAC GTGATTGGTT CCGGCGCCAA CGACCCGCAG AAGTACAGCC CCACCGCGTC CCGTGAAACG TTGGACCACT CGATCCCGTA CATCTTCACC GTCGCGCTGC AGGACGGCGC CTGGCACCAT GTGGACTCCT ACGCCCCGGA GCGGGCCGCG CGCCCCGACA CCATGGAGCT GTGGCACAAG GTCACCACGG TGGAGGACCC GGAATGGACC CGCCGCTACC ACTCCCTCGA CATCGCTGAG AAGGCTTTCG GCGGTACCGT CGAAATCACG CTCAACGACG GCACTGTCAT TACCGACGAG ATCGCCGTCG CCGACGCCCA CCCGCTGGGT GCCCGGCCGT TCGCCCGGGA GCAGTACGTC AACAAGTTCT GCACACTCGC AGCCGGCCTC GTCGCCGAAG ACGAGATCGA ACGGTTCCTT GCCGCAGCGG CCCGGCTCCC GGAACTGGCC GCCGGGGAAC TGGACCAGCT GAACATCAAG GCGGCCGACG GCGTGATCGA CCTCGCTGCC GCACCGAAGG GACTTTTCTA A
|
Protein sequence | MVKEHHVRVY KSEENLPRED QLAYKIAKVA TDPVAVTDEV TDMVINRVID NASVAIASLN RAPIVAARAQ ALTHGPTTGG KGSKVFGIDE RVAPEWAAWA NGVAVRELDY HDTFLAADYS HPGDNIPPIL AVAQHVGSSG HDLIRGIATG YEIQVNLVKA ICLHKHKIDH VAHLGPSAAA GIGTLLGLDV ETIFQSVGQA LHTTTATRQS RKGEISTWKA HAPAFAGKMA VEAADRSMRG QTSPVPIYEG EDGVIAWMLD GPEASYMVPL PTPGEAKRAI LDTYTKEHSA EYQAQAWIDL ARKLHGERPE VTDPANVKSV LIKTSHHTHY VIGSGANDPQ KYSPTASRET LDHSIPYIFT VALQDGAWHH VDSYAPERAA RPDTMELWHK VTTVEDPEWT RRYHSLDIAE KAFGGTVEIT LNDGTVITDE IAVADAHPLG ARPFAREQYV NKFCTLAAGL VAEDEIERFL AAAARLPELA AGELDQLNIK AADGVIDLAA APKGLF
|
| |