Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3332 |
Symbol | |
ID | 4444061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3743898 |
End bp | 3745013 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639691155 |
Product | hypothetical protein |
Protein accession | YP_832807 |
Protein GI | 116671874 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG CTACCCAACA GGCACAGCAG GCGACCACCG GCACCACCAC GGGACCGTGG AACCCCGCAG ACCGGGCCGT CAAGCGCCGG CGGGTCCTGG ACATCCTGGA TGCCGCGGGC AGGGACTCCC TGCTCCTGAC CACCAACACG GCGCTGACCT GGTACCTGGA CGGGAGCCGC GTCCACATCA GCCTCGCCGG CGACCCCGTC GCCGCCATGC TGGTCGACCG TGACGGTGAC CATCTGGTCA CCTACAACAA CGAAGCCGCC CGGATTGCGG CAGAGGAACT GCCCGACGGC GTGAACCTCC ACACGGTGCC CTGGCACGGG CAGCTGCACG CCGCCGCAGC CATGCTCGCA CCTGACGGAA GGCCCCTTGC GGAGACTGAT GTGGCCGCCG AGCTGAGGAC CGCCCGCCAG CCGTTCCTGC CCGGCGAGAG CGCCCGGTAC GCCCGGCTGT GCGCCGATGC CGCTGCAGCG ATGACAGCCG TCCTTTCCGG CACCACCCCG GAAACCACCG AGTTCGCTGT GGCCTCCGCC CTGGCTGCAC GGATCGTGGC GATGGGGGCC GAGCCGCTGG TGCTTCTGTG CAGCGGCGCC GGACGCAGCG GGTTCCGGCA CCCGCTGCCT ACCCACGCGC CGATCGGCCG GCGGGCCATG GCGGTGGTGT GCGCGCGGCG CAACGGACTG GTGGCCAATG TGACCCGCTG GGTGCGGTTT GACGCCGGAA CCCCGGGCGA ACTCGACGCC GAAGCCCGGA TTGCCGCAGT AGAAGCGGAC ATTTTCGACG CCACTGTGCC CGGTGCACGG TTGGACGGCA TCTTCGCTGA AATCCAGGAA GCCTACCTCC GCCACGGCTT CGGCGCAGAC CAATGGACCC TCCACCATCA GGGCGGCCCG GCCGGTTACG CGGGCCGCGA TCCCCGGGCG ACGCCCGGCA CCGACGATGC CGTGGTCCTC AATCAGACCT TCACCTGGAA TCCTTCCGGT CCCGGAGTGA AGATCGAAGA CACGGTCCAG CTGACGGAGA CGGGGATCAC CGTCCTCAGC GTGGACCCGA ACTGGCCGGC CGCCGTCGTT AACGGCATCC GGCGGCCGCT GACCCTGGAG CTGTGA
|
Protein sequence | MNTATQQAQQ ATTGTTTGPW NPADRAVKRR RVLDILDAAG RDSLLLTTNT ALTWYLDGSR VHISLAGDPV AAMLVDRDGD HLVTYNNEAA RIAAEELPDG VNLHTVPWHG QLHAAAAMLA PDGRPLAETD VAAELRTARQ PFLPGESARY ARLCADAAAA MTAVLSGTTP ETTEFAVASA LAARIVAMGA EPLVLLCSGA GRSGFRHPLP THAPIGRRAM AVVCARRNGL VANVTRWVRF DAGTPGELDA EARIAAVEAD IFDATVPGAR LDGIFAEIQE AYLRHGFGAD QWTLHHQGGP AGYAGRDPRA TPGTDDAVVL NQTFTWNPSG PGVKIEDTVQ LTETGITVLS VDPNWPAAVV NGIRRPLTLE L
|
| |