Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2629 |
Symbol | |
ID | 4444716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2947823 |
End bp | 2948866 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690449 |
Product | peptidase M4, thermolysin |
Protein accession | YP_832108 |
Protein GI | 116671175 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.235261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTGCT CCATCATCCC GCCCTACATG CTGCGGCGCC TTGCGGCACA GCACGAACCC CGCCTGTACG CGGTGGCCCG GGCGGCGAAG GAAGCCCTGC TTCATATCAG GTCCATCCAG GCCAGCCGCG CGCTTCCGCT GCCGGCAGTC CCGTCCGGTA TCCGGCAGCT GAAGCCAGGT CCGCCGAACC GCACCGTGTA TGACGCCGGG ACCTCCGAGT CCCTTCCCGG CCAGCTGGTC CGCAGGGAGG GTGAACCCGC CACGGGAGAC CCCTCGGCGG ATGAGGCCTA CGACGGACTG GGACACACCC ACCGGCTTTA CGCAGAGGCC TTCGGCCGGA ACTCGATCGA CGGCCAGGGG CTCGCCCTGG ATGCCAGCGT GCATTTTGGG AAGCTCTACG ACAACGCCTT CTGGGACGGA CACCAGATGG TGTTCGGGGA CGGCGACGGC GAAATCTTCC AGCGTTTCAC TCGGTCCCTG AGCGTCATTG GCCACGAGCT GGCGCACGGC GTGACGCAGC ATTCCGCCGG ACTCGTCTAC CGGAACCAGG CCGGTGCCAT CAACGAATCG CTGTCGGACG TGTTCGGCGC ACTGGTGGAA CAGTACCTCG CGCAGCAGTC GGCATCGGAG GCCAGCTGGC TGATCGGGGA AAACCTGTTC ACCGACAAGG TACAAGGCTC AGCCCTGAGG TCGTTGAAGG CTCCCGGCAC CGCGTACGAC GACGACGTGC TGGGCAAGGA TCCCCAGCCG GACTCCATGG ATTCCTACGT CCGGACCAGC GCGGACAACG GCGGCGTGCA CATCAACTCC GGCATCCCCA ACCGCGCGTT CTACCTCGTG GCGTCGGCAC TTGGCGGCAA CGCCTGGGAA ACCCCGGGCC GGATCTGGTA CGACGCCCTT ACCGGAGGTT CGCTGCCTGC CAACGCAACG TTTACCAAGT TCGCCAAGGC CACGGCCGCG TCCGCCGCAG AACTGTTCGG CGCCGCTTCC CCGGAACATG ACGCCGTCCG GAAGGCGTGG GAAACTGTGA AGGTAAAGCT GTAA
|
Protein sequence | MYCSIIPPYM LRRLAAQHEP RLYAVARAAK EALLHIRSIQ ASRALPLPAV PSGIRQLKPG PPNRTVYDAG TSESLPGQLV RREGEPATGD PSADEAYDGL GHTHRLYAEA FGRNSIDGQG LALDASVHFG KLYDNAFWDG HQMVFGDGDG EIFQRFTRSL SVIGHELAHG VTQHSAGLVY RNQAGAINES LSDVFGALVE QYLAQQSASE ASWLIGENLF TDKVQGSALR SLKAPGTAYD DDVLGKDPQP DSMDSYVRTS ADNGGVHINS GIPNRAFYLV ASALGGNAWE TPGRIWYDAL TGGSLPANAT FTKFAKATAA SAAELFGAAS PEHDAVRKAW ETVKVKL
|
| |