Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1208 |
Symbol | |
ID | 4446304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1313854 |
End bp | 1316349 |
Gene Length | 2496 bp |
Protein Length | 831 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639689015 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_830702 |
Protein GI | 116669769 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTGA CTCCCCAGCG ACATCCCCCT CAGGAGACGG ACCGCACGCC TGAGGTCCTT GCCCCGTTAG CCGCACCGGC CGCGCGGCTG CGCCGCGCTG GTGCGCAGCC GTGGGCCATG GCGGCTGCCG TCGCCCTCGC TGTGTCCGGA GCCGCCCTAT CCCTTAACGG CGTACTGCGG GGCTGGGCGT GGTATCCGCC GGTGCTTACC ACCGTCTGCG TTGTTGCCCT AACCATGGCC CTCCTGAGGT CGCTGCGCGC GCAGCCATTC CTCGTGGCGC TGGGCGGCTT TGTGTCCCTC GCTTTCATCC TGACTTTCAC GTTCTTCCGC CGGGACAGCA TCGCCGGGTT CATTCCGTCC GGGGACACCC TGGAACAGCT GGGGAGGTAC CTGCGGCGGG CAAGTGAAAC CGTCCTGGCC GAGAGCTCCC CGGTTGCGCC GAATGCCGGC ATAGTGCTGG TCACGTGCGC GGTCCTGGGC CTCGTGGTGA TCCTGGTCGA TGCCCTCGCA GTGCCGCTGG CCCTGCCGGC AACTAGCGGG CTGGGACTGG TGGCCATCCT GGTGGTGCCC GCCATGGTCA AACCGCAGAG CGTCGGGGTG TGGGGGTTTG CCGCAGCAGC CGCCGGCTAC CTGATGATCC TTGCCTGCAG CCAGTGGTTC GCCCCGGACA ACCGCACTAC GGCGGACACC GCCCGCACTC CGGGACAGTT CCGCCGCGCT GCCCTCACCG GCGGCATCGC GCTGGTCATC ACGCTCCTCC TTCCCCTGGC CGTTCCGGGT TTCGACCAGG GCACGTTCCC ACAGGGTTCC AGGCTGAATC CTTGGGGTTC ATCCACCGGG CTCAATCCGA TGATCAGCCT GGGCAACAGC CTGCGGAGTC CCTCCGGCGA CGGACGCATC ACTTATGCCA CCAGTGCCAG CAGGCCCCCG TACCTGAGGT CCGTCACGGT GGACCGGTTC GACGGCGACT CCTGGTCACC GGACGACCGC GACGCTTCCC GGCGCGTGGG GGTCGGAAGG ATGGACGCCG GGTACGAGAT CCTCGCGGAC GAGCAGGTCC GCCAGGTCAC GGCGGTCAAT ACCGGCCGTT TCACCAGCCC GTACCTGCCG GTGCCGTACG CCCCGGATGC CGTCAACGGC CTGGCCGGAC GATGGAGCTG GGATCCGGCC ACGCTCAGCA TCAAGGGGGT GGACACCAAT TCACGGAACC AGCAGTACGT CGTGCAGTCG ACTGCACCCA GGCTCACGCC GGAACTGTTG TCACAGTCCT CGGCACCGGT GCGCGGGATC TCCGAGGAGT TCCTCCGGGC GCCCAACAAC GTGCCGGACA TTGTTCGGAG CACGGCGGAT AAAGTGACCG CCGGAAGCAG TACGGCGTAT GCCAAGGCGA TGGCCATCCA GAACTACCTG CGCTCGCTGG AGTTCACGTA TTCCTTGCAG TCCCCGGTCC AGGGCGGCTA CGACGGCAAC GGACTTTCCG TACTCGCGGA CTTCCTGACG CAAAAGAGCG GCTACTGCAT CCATTTCGCT TCGGCCATGG CCGTGATGGC CAGGCTGGAG GGAATCCCAA GCCGGATCGC CGTGGGCTAC GCCCCGGGCC GGTCCACGGG TGCCACGGTT TCGGTTCCCG GGCAAGGTGC ACTGACGGAA TACGAAGTCG ACTCCAGGGA CGCGCATGCC TGGCCCGAAC TCTATTTTCA AGGACTCGGA TGGGTGGCAT TCGAACCCAC GCCTTCACGC GGCGTGGTGC CGGACTACGC GTCGCAGACC TCCACGCCGG GGGGTGCCAG CACGAACGAA AACAACGACG ACCTCATCCC GTCGAACGCC GCAACGCCCG GCGCAACGCC AGGCGCCACC CCGACGCCGC TGCCGGGGTT GGAATCAGCC GGTGAACCAG GGCACCCGCT GACTCTGCCG CTCTACGGCA CCGCCGCTGT ACTCCTCCTT GCCCTGATCG CCGGGTCGCC GCGGATGGTG CGGGCCGGCC TGCGGTCGCG GCGGCTCAGG AGCGTGCCGG CCCTCGGGGG CAACACGGCC GCAGCGGCCT GGGCCGAACT GCGGGACCTG GCTACCGACT ACGGGCTGGC TCCCGGCCCC AGCGAAACCC CGCGTCACTT TTCCAGCCGG CTCAGTGGAT CGGGTGCGCT TGGCGCGCGC GACAACGCCG GAATCCGGGC TGTCGCCGCA CTCACCACGG ACTTCGAACG TCAGCAGTAC GGGCCGCCTG CCGGCCGTGC CGGGGGCGCT GAAGACGTCG CGGCGCCTTC AGCAGCAGAG AGGATAGCCG TGATCCAGGC ATCCCTGCGA AGGCATTCCC GCCGGGGACG GCAGCTGCGG GCGGAGTGGC TGCCGCCGTC GGTCATGAAT CGCTGGGGGC GCATCGTTTC CGCGCCCTTC CGTGCCGTCC TGCGGGCCGC CGCCAAACCC GCGCAGGCCG CCGCGCGGTC CTGGCGGACG GTGCGCGACG GCCTGCGGCG GCTGCGACGG GGCTAG
|
Protein sequence | MTLTPQRHPP QETDRTPEVL APLAAPAARL RRAGAQPWAM AAAVALAVSG AALSLNGVLR GWAWYPPVLT TVCVVALTMA LLRSLRAQPF LVALGGFVSL AFILTFTFFR RDSIAGFIPS GDTLEQLGRY LRRASETVLA ESSPVAPNAG IVLVTCAVLG LVVILVDALA VPLALPATSG LGLVAILVVP AMVKPQSVGV WGFAAAAAGY LMILACSQWF APDNRTTADT ARTPGQFRRA ALTGGIALVI TLLLPLAVPG FDQGTFPQGS RLNPWGSSTG LNPMISLGNS LRSPSGDGRI TYATSASRPP YLRSVTVDRF DGDSWSPDDR DASRRVGVGR MDAGYEILAD EQVRQVTAVN TGRFTSPYLP VPYAPDAVNG LAGRWSWDPA TLSIKGVDTN SRNQQYVVQS TAPRLTPELL SQSSAPVRGI SEEFLRAPNN VPDIVRSTAD KVTAGSSTAY AKAMAIQNYL RSLEFTYSLQ SPVQGGYDGN GLSVLADFLT QKSGYCIHFA SAMAVMARLE GIPSRIAVGY APGRSTGATV SVPGQGALTE YEVDSRDAHA WPELYFQGLG WVAFEPTPSR GVVPDYASQT STPGGASTNE NNDDLIPSNA ATPGATPGAT PTPLPGLESA GEPGHPLTLP LYGTAAVLLL ALIAGSPRMV RAGLRSRRLR SVPALGGNTA AAAWAELRDL ATDYGLAPGP SETPRHFSSR LSGSGALGAR DNAGIRAVAA LTTDFERQQY GPPAGRAGGA EDVAAPSAAE RIAVIQASLR RHSRRGRQLR AEWLPPSVMN RWGRIVSAPF RAVLRAAAKP AQAAARSWRT VRDGLRRLRR G
|
| |