Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1894 |
Symbol | |
ID | 4445583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2131507 |
End bp | 2133000 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639689706 |
Product | betaine-aldehyde dehydrogenase |
Protein accession | YP_831378 |
Protein GI | 116670445 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0153798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAATCC AACACGCACC CGAGGCATCA CCTGCCACGA CCTGGATCCC CGGAAAGCAG TTCATCAACG GTAACTGGCG CGATGGCGGA TCAGAGAACA TCCTGACCGT CACCAACCCG TACAACGGTG AAACGCTGAC CACAATCCGC CAAGCCAATC TGGGCGATCT TGACGAAGCT TACGAAGCCG CCGCACTGGC GCAAATTGAC TGGGCGGCCC AGAGTCCGGC TGCACGTCAG CAGGTGCTGC TACGGGCGGC CCAAATCCTT GAGGAACGCC GTGAAGAAAT CACCTCCTGG CTGATCGCCG AATCCGGCAG CACCGTGCTG AAAGCCAACA TTGAACTTGG CGCCGCGCAA GGAATCACCG TCGAGTCAGC ATCCTTCCCC CACCGCGTCC AAGGCCGGAT TCTCGAATCA AATACCAAAG GCAAGGAAAA TCGCGTGTAC CGCCGGCCGC TCGGCGTCGT CGGAGTAATC AGCCCGTGGA ACTTCCCCCT CCATCTCACT CAGCGCTCCA TCGCACCCGC ACTGGCACTC GGCAACGCCG TGGTGATCAA ACCAGCCAGC GACACACCCG TGAGTGGCGG CCTCATCCTA GCCAGAGTCT TCGAAGAGGC CGGCCTGCCG CCGGGCGTGC TGAACGTCGT CGTCGGCGCA GGTTCGGAAA TCGGTGACGC CTTCGTTGAG CACCACGTAC CATCCTTCAT TTCCTTTACC GGATCAACAC CTGTGGGACA AAACGTTGCC AGACTTGCCG CCACCGGATC GCATCTTAAG CACGTGGCAC TGGAACTCGG CGGTAACAGC CCATTCGTCG TTCTCGCCGA CGCCGACCTG GACCAGGCGG TTCGGGCCGC CGCCATGGGC AAGTTCCTCC ATCAGGGCCA GATCTGCATG GCCATCAACA GAATCATCGT CGAAGACGGC ATCTACGACG AATTTCTCAC CCGTTTCGCC GACCACGTCC GGACGTTGCC GATAGGCGAC CCGGCCGACC CGGCCACGGT CATCGGGCCG GTGATCAACA GCAAGCAACG CGAAGGCTTG GAGAAGAAAA TAGCCAGGGC GGCCGAAGAA GGAGCCAGGA CCCTGCTGGG CGGTCCGGCG GCTGGCCAGG TCCTGCCGCC ACACATCTTT GCCGATGTCA CAGCGGAAAT GGAGATAGCC CGTGAGGAAA TCTTCGGACC ACTCGTCGGG ATCCTCAAAG CCCGAGACGA ACAACACGCC CTGGAATTGG CCAACGACAG TGACTTCGGC CTCTCCAGCG CCGTTTTCAC TGAAAGCACG GAACGCGGCG TCAGGTTCGC CCGCGGGCTA AAGGCAGGAA TGACCCACGT CAATGACATT CCCGTCAACG ACGAACCGCA TGTGGCATTC GGGGGCGAAA AGAGCTCTGG GCTCGGACGC TTCAACGGCG AGTGGGCCAT CGACGAATTC ACCACCGACC ACTGGGTCAG CGTGCAGCAC GAAGCACGCC AGTACCCCTT CTAA
|
Protein sequence | MTIQHAPEAS PATTWIPGKQ FINGNWRDGG SENILTVTNP YNGETLTTIR QANLGDLDEA YEAAALAQID WAAQSPAARQ QVLLRAAQIL EERREEITSW LIAESGSTVL KANIELGAAQ GITVESASFP HRVQGRILES NTKGKENRVY RRPLGVVGVI SPWNFPLHLT QRSIAPALAL GNAVVIKPAS DTPVSGGLIL ARVFEEAGLP PGVLNVVVGA GSEIGDAFVE HHVPSFISFT GSTPVGQNVA RLAATGSHLK HVALELGGNS PFVVLADADL DQAVRAAAMG KFLHQGQICM AINRIIVEDG IYDEFLTRFA DHVRTLPIGD PADPATVIGP VINSKQREGL EKKIARAAEE GARTLLGGPA AGQVLPPHIF ADVTAEMEIA REEIFGPLVG ILKARDEQHA LELANDSDFG LSSAVFTEST ERGVRFARGL KAGMTHVNDI PVNDEPHVAF GGEKSSGLGR FNGEWAIDEF TTDHWVSVQH EARQYPF
|
| |