Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1091 |
Symbol | |
ID | 4446429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1178910 |
End bp | 1179896 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639688897 |
Product | hypothetical protein |
Protein accession | YP_830585 |
Protein GI | 116669652 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3591] V8-like Glu-specific endopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000677339 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAT CGAAGACTCT GGCCACCAGC CTCCTGAGCC TGTCCGCCGC CGCGCTGCTG GCCGTCTGCG CAGCCGGCGG GGCTAGTGCG GCGCCCGCAT CACAGAACGC CTCACCACAG GACGGCGTTC AGGTTGCCAG CCAGTCGGTG GACACCACTA ACGCCGCCGA CTACTGGACG GCCGACCGCA TGCGCGCAGC GGTTCCCGGG GACGTACTGG CCGCCAAGGC GCTGCAGCGC GGCAACACCT CCTCAGCCGC GGCGGTGGAA AAGGGTTCGA GTACCAAAAT CACGGGCAAG GCAGGCAAGG GAAAGACCGT CCTGCACGTC GACGAGAATC CGGTGTCCCA CATTGGCAAG GTCTTCTTCA CCATGGGCGG CAGCAACTAC GTCTGCTCCG GGAACTCCGT GGTGTCCAAC AACAAGAGCA CCGTGTCCAC AGCCGGCCAC TGCGTCAATG AGGGTCCCGG CGCCTTTGCC ACCAACTTCG TCTTTGTCCC GGCCTACCTG GACGGCGCCG CGCCGTACGG CAAATGGGCA GCCAAGGCCC TGTACACCCC CACCCAGTGG AGCTCGGCCG GAGACATGCA GTACGACACA GGCTTCGCCG TCGTCTCGCA GCTCAACGGC CAGAGCCTGG CCGACGTGGT GGGCTCATCC GGAGTCCAGT TCAACGCAGC GCGCGGGCTG ACCTACAAGT CCTACGGCTA CCCGGCCGCG GCCCCGTTCG ACGGACAGTC GCTGGTCAGC TGCACCGGCC CGGCCAGCGA TGACCCGTAC AACCCCCAGT TCAATACGCA GGGCATCCCG TGCGACATGA CGGGCGGTTC CTCAGGCGGT CCGTGGTTCA TCGGCACAAG CTCCAGCGGC TATCAGAACT CCATCAACAG CTACGGCTAC AGCGGCGCCC CCTCCAAGGT CATGTATGGA CCGTACTGGG GTTCCGTCAT CCAGCAGGCG TACTCGAGTG CATCGTCGGC GAACTGA
|
Protein sequence | MTKSKTLATS LLSLSAAALL AVCAAGGASA APASQNASPQ DGVQVASQSV DTTNAADYWT ADRMRAAVPG DVLAAKALQR GNTSSAAAVE KGSSTKITGK AGKGKTVLHV DENPVSHIGK VFFTMGGSNY VCSGNSVVSN NKSTVSTAGH CVNEGPGAFA TNFVFVPAYL DGAAPYGKWA AKALYTPTQW SSAGDMQYDT GFAVVSQLNG QSLADVVGSS GVQFNAARGL TYKSYGYPAA APFDGQSLVS CTGPASDDPY NPQFNTQGIP CDMTGGSSGG PWFIGTSSSG YQNSINSYGY SGAPSKVMYG PYWGSVIQQA YSSASSAN
|
| |