Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2171 |
Symbol | |
ID | 4445197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2444842 |
End bp | 2446053 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639689980 |
Product | peptidase M50 |
Protein accession | YP_831651 |
Protein GI | 116670718 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.121465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCATT CCGGAGCCAC GGCCTTGACT GACCCGGCGG GCGGCAGCGA CCCGAAGGTT GCCAATGGCC GCAGGGAAGG CATATCGCTG GGCCGCATCG CGGGTATCAG GGTAGTCCTG GCGTATTCGT GGTTCATCAT TGCTGCGTTC ACCGTGATTG TGTACGGACC GGTGCTGGAG GGGCAGTACC CGGACATGGG AATCACGGCC TACTACGTGG CCTTCGCTTA TGCCCTGCTC CTGCTCCTCT CAGTACTTGT CCACGAGCTC GCCCACGCCC TGACGGCCAA GATCTTCCAC TGGCCCACCG AGAAAATCGT GCTCAACCTC TGGGGCGGCC ACACGCAGTT CGAGGGCTTC ACCGCAACGC CCGGACGCTC GGTGCTGGTG GCGATGGCCG GGCCTGCTGC GAACCTGGCA CTGGCCGGTG CCGGCTGGCT GTTCATCCTG GCAACGGATC CCTCCGGCGT GGCGGGAATC CTGTCCAACA TCTTTGTCTG GGCGAACCTG CTCATCGGCA TCTTCAACGT CCTTCCCGGG CTGCCGCTCG ACGGCGGCCG CCTCGTTGAG TCCGCTGTCT GGAAAGCCAC CGGAAGCCAG GAAAAAGGCA CCGTGGCCGC CGGCTGGGCA GGGCGCATCA TTGTCATTGC CCTAGCGGTC TGGTTCGTCC TGCTTCCGCT GGCGCGCGGC GACCGCCCCG ACGTTTCACT GATGCTCATC ACCGTGCTGG TGGGCAGCTT CCTGTGGATG GGGGCGTCGG CCTCCATCCA GCACGGAAGG CTTCGCAGCC GGCTGCACCT TGTCAACGCA GCGGCCCTCG CGGAACCAGC GGTCGGAATA CCCGAATCCT CCACCGTGGC CGACGTTCTG CGGCTTGCTC CGCAAGGCAC CCCTGCCATC GTTCTCTGCG GATCGGACGG ACGTCCGGCC GGAATAGTCT CCGATGTCGC CGCCGCCTCC GTTCCCGCAG GTGCCGCAGC AACAACCCCC GCCACCGCAG TGGCGCACGC CCTCAGCGCC GGCGCCTACG TGCCCGAATG GTCCCAGGGC CAGGAGCTGG TGCAGTACCT TGCTCAGCTT GAGGGCCACG AGTATGCCGT GGTGGACCAC CACGGAAAGG TGACCGGTCT GCTCCGCCAG CAGGCCGTGG TGACCGCCAT TACAGGCAAA GAAATGCGCC GCGGCGGGCG CGCCAAGGCG CAGAACCGGT AG
|
Protein sequence | MQHSGATALT DPAGGSDPKV ANGRREGISL GRIAGIRVVL AYSWFIIAAF TVIVYGPVLE GQYPDMGITA YYVAFAYALL LLLSVLVHEL AHALTAKIFH WPTEKIVLNL WGGHTQFEGF TATPGRSVLV AMAGPAANLA LAGAGWLFIL ATDPSGVAGI LSNIFVWANL LIGIFNVLPG LPLDGGRLVE SAVWKATGSQ EKGTVAAGWA GRIIVIALAV WFVLLPLARG DRPDVSLMLI TVLVGSFLWM GASASIQHGR LRSRLHLVNA AALAEPAVGI PESSTVADVL RLAPQGTPAI VLCGSDGRPA GIVSDVAAAS VPAGAAATTP ATAVAHALSA GAYVPEWSQG QELVQYLAQL EGHEYAVVDH HGKVTGLLRQ QAVVTAITGK EMRRGGRAKA QNR
|
| |