Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3021 |
Symbol | |
ID | 4444388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3388517 |
End bp | 3389647 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639690845 |
Product | aspartate-semialdehyde dehydrogenase |
Protein accession | YP_832500 |
Protein GI | 116671567 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0136] Aspartate-semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01745] aspartate-semialdehyde dehydrogenase, gamma-proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.458955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACAG CAGCTACCCC CTCCGTCGGC CTGGTCGGAT GGCGCGGCAT GGTCGGCTCC GTCCTGATGC AGCGCATGCA GGATGAAGGC GACTTCGCCA GCATCAACCC GGTGTTCTTC TCCACCTCCA ACGCCGGAGG TGCGGCGCCA TCTTTTGCGG AAGGCGCAGG CAAGCTCGAG GACGCGTTCG ACGTCGACAC GCTGGCTAAG CTGCCGATTA TTGTTACCGC CCAGGGCGGC GATTACACCA AGCAGGTCCA CGGTGAGCTG CGCAGCCGCG GCTGGGACGG CCTCTGGATC GACGCCGCCT CCACCCTGCG AATGAATGAC GACTCGATCA TCGTGCTGGA CCCCATCAAC CGCGACGTCA TCGACAAGGG CCTCGCCAAC GGCACCAAGG ACTTCATCGG CGGAAACTGC ACCGTGTCCT GCATGCTGAT GGGCCTGGGC GGCCTGTTCA AGAACGGCCT CGTCGAGTGG GGCACCTCCA TGACCTACCA GGCTGCCTCC GGCGGCGGCG CCCGCCACAT GCGCGAGCTG CTCAGCCAGT TCGGTACGCT CAACGCGGAG GTCAGCTCGG AACTGGACGA CCCGGCGTCG GCCATCCTGG AAATTGACCG CAAGGTCCTG GCACACCAGC GCACCGACAT CGACGCGACG CAGTTCGGCG TCCCGCTGGC CGGCTCGCTG ATCCCTTGGA TCGACGCGGA CCTCGGCAAC GGCCAGTCCA AGGAAGAGTG GAAGGCCGGG GTCGAGACCA ACAAGATCCT CGGAACCTCC GGCGAAAACC ACATCATCAT GGACGGCCTG TGCATCCGGA TCGGTGCGAT GCGTTCGCAC TCCCAGGCCC TCACGCTCAA GCTCCGCGAA GACCTCTCCG TGGCCGAGAT CGAGAAGCTC CTGGACGCGG ACAACGAATG GGCCAAGGTG GTGCCCAACA CCAAGGAAGA CTCCATGGCG AGCCTGACCC CGGTGGCCGC CTCAGGGACG CTGGACATCC CGGTGGGCCG TATCCGCAAG CTCGAAATGG GCCCGGCTTA CATCAGCGCC TTCACTGTGG GTGACCAGCT CCTCTGGGGC GCCGCCGAAC CGCTGCGCCG CATGCTCAAC ATCGCCACCG GCACGCTCTA A
|
Protein sequence | MTTAATPSVG LVGWRGMVGS VLMQRMQDEG DFASINPVFF STSNAGGAAP SFAEGAGKLE DAFDVDTLAK LPIIVTAQGG DYTKQVHGEL RSRGWDGLWI DAASTLRMND DSIIVLDPIN RDVIDKGLAN GTKDFIGGNC TVSCMLMGLG GLFKNGLVEW GTSMTYQAAS GGGARHMREL LSQFGTLNAE VSSELDDPAS AILEIDRKVL AHQRTDIDAT QFGVPLAGSL IPWIDADLGN GQSKEEWKAG VETNKILGTS GENHIIMDGL CIRIGAMRSH SQALTLKLRE DLSVAEIEKL LDADNEWAKV VPNTKEDSMA SLTPVAASGT LDIPVGRIRK LEMGPAYISA FTVGDQLLWG AAEPLRRMLN IATGTL
|
| |