Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4274 |
Symbol | |
ID | 4443442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008538 |
Strand | + |
Start bp | 6788 |
End bp | 7879 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639687595 |
Product | phage integrase family protein |
Protein accession | YP_829292 |
Protein GI | 116662238 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTGG TGGCGTTGCC GGGTGGCGGC GCCGTGGATC CAGCGTTCTG GAGTGTGCAG GCAGCCGAGG ACTTTGAACA GGAGATCGTC GACCAGTACG CGCTGGCGAT GGCCGCGGCT GGTCTGAGTG ACTCGCATAT CCGCAATACG CGGTCGACAA TCATCGAATT CGCCCGTTCG GTGACCGGCC CGCTGTGGGC GGCAACGTGT CAGGATGCCG ACCGGTTCCT GGCCGAGCAG AGACGGCTGG GCCGCAGCGT GAGCACCAGG GCCGGCAAGG CAGGGACGCT GGCGTTGTTT TACGAGTTCA TGATCAGCCG GTACCAGGGC CGGATCCACC GGTTGACCGG GGTACTGGTC GAGCAGCCTA TCGACGAGTT CAACCGCCAG GCCGGGGCAT CGCTGGGCAA GGTCCGTGTG CCGCCGTCGG ATGCCGAGAT TGACGCGTTC TTCACCTGTT GGAGGCACTC GATACCCCAG GCCCGCAAGT ATTTGCCTGC CGCGCGGGAT TACTTCGCTG CTTCGCTGTG GCGCCGGCTG GGGTTGCGGA TCACCGAGAC GGTGATGCTC GACATCCGTG ATTGGCGCCC TGACCTGGGC GGGTTCGGCA AGCTCCACGT GCGGTACGGC AAAGGAGCCC ACGGCCGTGG CCCCAAGCCG CGCCTCGTCC CGGCTATCAA CGGCGCGGCC GAGCTGATCG ACTGGTGGCT GGGCGACGTC CGGCACCGGT ACGGCGAGGA CTGGGCCGAC CCCGACGCAC CCCTGCTCCC CTCGGAACGG TTTGACCGTG AGCTGGGACG ATGCGGCCGG GTCGGTGGCA ACGCGCTGCG GCGAAGTCTG GGGCTGCAGG TCGACCAGTG GCTGCCGGCA TGGTCCGGAA GGATGACTCC CCATGTTCTG CGTCATTACT GCGCTTCCTC GCTCTACGGG GCAGGGATGG ACATCAAGGC CCTCCAGGAG CTGCTTGGGC ATCAGTGGCT CTCGACTACC TCGGGCTACA TCCACGTGCG CAGCGAGCAC GTCGAGCAGG CCTGGAAGAA CGCCAACGAG CGGGTCGAGT CCCGCTTCGC GACCACACAG AAGGAAGGAT GA
|
Protein sequence | MALVALPGGG AVDPAFWSVQ AAEDFEQEIV DQYALAMAAA GLSDSHIRNT RSTIIEFARS VTGPLWAATC QDADRFLAEQ RRLGRSVSTR AGKAGTLALF YEFMISRYQG RIHRLTGVLV EQPIDEFNRQ AGASLGKVRV PPSDAEIDAF FTCWRHSIPQ ARKYLPAARD YFAASLWRRL GLRITETVML DIRDWRPDLG GFGKLHVRYG KGAHGRGPKP RLVPAINGAA ELIDWWLGDV RHRYGEDWAD PDAPLLPSER FDRELGRCGR VGGNALRRSL GLQVDQWLPA WSGRMTPHVL RHYCASSLYG AGMDIKALQE LLGHQWLSTT SGYIHVRSEH VEQAWKNANE RVESRFATTQ KEG
|
| |