Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0036 |
Symbol | |
ID | 4447506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 42175 |
End bp | 43791 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639687830 |
Product | aldehyde dehydrogenase |
Protein accession | YP_829537 |
Protein GI | 116668604 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTCA CCGGACATTC CCTGATCGCC GGGCAGGCCG TCGCCGGCGA AGGCAAGACT GCCTTTGGCT TCAACCCCGC CAGCAACGAA CAGCTTGAGC CCGCCTACAC CCTGCTCACC GAGGACCAGC TCAAAGCCGC CACCGCCGCG GCCGGCGAAG CCTACCCCTC CTTCAGCACA CTCGATCCCG AAACCCACGC GAGCTTCCTG GAAGCCATCG CGGACAACAT CGAGGCCATC GGCGACGACC TGATCGTCCG CGCCGGACAG GAGACCGGAC TGCCCGCAGC CCGACTACAA GGTGAACGTG CCCGCACCAC GGGGCAGCTC CGGCTGTTCG CGAACGTTGT CCGCCAGGGC GATTTCCGCG GCGTCCGCAT CGACCCGGCC CTGCCGGAAC GCACGCCGCT CCCCCGCGCC GACATCCGCC AGCGCCAGAT CCCGCTGGGA CCCGTGGCGG TGTTCGGTGC CAGCAACTTC CCGCTGGCCT TCTCGACGGC GGGCGGAGAC ACCGCTTCGG CCCTCGCCGC CGGCTGTCCC GTAGTCTTCA AAGCCCACAA CGCCCACCCC GGCACGGGCG AACTTGTCGG CCAGGCCATC GTCAAAGCCG TCCGCGATTC CGGGCTCCAC CCTGGCGTGT TCTCGCTGAT CTACGGCCCC GGCAGCAGCA TCGGCCAGGC CCTTGTGGCG GACCCGGCCA TCAAGGCTGT GGGCTTCACC GGCTCGCAGA GCGCCGGCAT TGCGCTGATG CGCACCGCAG CAGCCCGCCC GGAGCCCATC CCGGTCTACG CGGAAATGTC CTCGCTCAAC CCGGTCTTCG TGTTCCCCGG CGCCCTCACC GGCTCCGCCG AGCAGATCGA CGCACTGGCG CAGCAGTACG TCACCGCCGT CACCGGCAGC TCCGGACAGC TCTGCACCTC CCCCGGCCTG CTGTTCGCCC CCGCAGGTGA GCTGGGCGAC AAACTGGCTG CCGCCGTCGG ACGCGCAGTA TCCGCCTGCG CCGGCCAGAC CATGCTGACC GCCGGCATCG CCGGTTCGTG GAACAGCGGG GCCGAGACGC TCGGCTCAGC CGACAACGTG ACCGTCGTCG GCCAGGGAAC CGCCGGACCC ACCGAAAACG CACCGGCCCC CACCATCTTC GGGACCGACA TCGCCGACTT CGTCAGCAAC CATGTCCTGC ACGCCGAGAT CTTCGGCGCG GCCAGCCTGG TGATCCGCTA CTCCACCGCC GGGGAACTGA TCGAGGCCAC CAACCGGCTC GAGGGGCAAC TCACCGCATC CCTGCAGCTC ACCGAAGAGG ACTACCCGAC GGCGGCGCAA CTGCTGCCCG CCCTGGAACA GAAGGTGGGG CGGATCATCG TCAACGGTTG GCCCACCGGC GTCGAAGTGG GTCACGCCAT GGTCCATGGC GGCCCCTTCC CGGCGACGTC GGACACGCGG ACGACGTCGG TCGGCACCCT GGCGATCAAC CGATTCCTCC GGCCGGTCGC CTACCAGAAC CTGCCCCAGG AACTGCTCCC GGCTCCGCTG CAGGACGCCA ACCCGTGGCA CCTGAACCGC CGGATCGACG GCACGGTCGA AGCCGCAGCC GACGCAGAAG ATAAGGTCAA CGCATGA
|
Protein sequence | MTLTGHSLIA GQAVAGEGKT AFGFNPASNE QLEPAYTLLT EDQLKAATAA AGEAYPSFST LDPETHASFL EAIADNIEAI GDDLIVRAGQ ETGLPAARLQ GERARTTGQL RLFANVVRQG DFRGVRIDPA LPERTPLPRA DIRQRQIPLG PVAVFGASNF PLAFSTAGGD TASALAAGCP VVFKAHNAHP GTGELVGQAI VKAVRDSGLH PGVFSLIYGP GSSIGQALVA DPAIKAVGFT GSQSAGIALM RTAAARPEPI PVYAEMSSLN PVFVFPGALT GSAEQIDALA QQYVTAVTGS SGQLCTSPGL LFAPAGELGD KLAAAVGRAV SACAGQTMLT AGIAGSWNSG AETLGSADNV TVVGQGTAGP TENAPAPTIF GTDIADFVSN HVLHAEIFGA ASLVIRYSTA GELIEATNRL EGQLTASLQL TEEDYPTAAQ LLPALEQKVG RIIVNGWPTG VEVGHAMVHG GPFPATSDTR TTSVGTLAIN RFLRPVAYQN LPQELLPAPL QDANPWHLNR RIDGTVEAAA DAEDKVNA
|
| |