Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4337 |
Symbol | |
ID | 4443483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008538 |
Strand | + |
Start bp | 76144 |
End bp | 77301 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639687658 |
Product | hypothetical protein |
Protein accession | YP_829355 |
Protein GI | 116662301 |
COG category | [R] General function prediction only |
COG ID | [COG4469] Competence protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.478292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAGCG AATCAACGTC GGAGCGCCGG GATATCTACG CGGTCCTTGG TTCCCCTGAC TCGCGCTTTC CCGTCGAAGC GCCGGCAGAT CCGAAGGAAG CCCGCCGGCT GAAGGGCGGA AACAGTTTCT ACTGCTCGAC CGCACTTGGC GGGTGCGGAG GGGAACTGAC CTTTGCCATC GGCGACGTCA ACATTCCCCA CTTCAGGCAC CAGGCGGGAA GCAGATGTTC ACTGATCTCG TCCAAGACCC TGGCCGACCG CTACACCCAC CTTGCCATCC AAGAAGCATT GCGGGCGTGG ATCGAAACCA TGCCGGGCTT CTCCTGCCGC CTTGAGGTCT CCATAGAAAG CGGACGCACG GACGTCCTGG TGACAGGTCC GTCCTTCGAG GTTGCCCTCG AAGTGCAGCG TTCAGCGCTG TCTGCCCGCA ACGCCCTGGA ACGGACGGCG GTCTACAGCC ATAGAGCCAA CGCAGTGCAA TGGCTGTACG CATTCAGGGA CATCGATGCC TACAAGGCGG AGCTTGCCGA CCGCGGATGG AGCCTGAGGA TCTGGTACGG GTGGGCCAAG AAGGAATGCA GGATCGGCGT CAGCTACGAA ACCGAAACCG GCGCGGAGGT GGAAATAAAG GAAGCCGGCG GACCGCTGAC AGACTGGGAC ATATCCTTTC GTGGTTTGGA CTCGGTCCAC CTCCGCAAGG CAAAAGCGGC AGTGGAGCGC CTGAGAGCGA CGGAGCGGGA GCGACGCCTG GAGCAGGCCA GGGAGGAGGC TGCCCGCGAA GCCGCGGAAA AAGCGCGTAA AGAGGCCGCG CGGCTCCGTC ACATCGCAGA CCAGAGAGCA GCGCACGAGT CTCTTCTCAG GGCCTTGCAG CACACCCCGG AGGGGCTGGA AAACAAATGG CCATCGTCGT GGCCCCAGCT TAAAGGCAGC CCGGGACAGG TCTCATGGGC AGAATCGATT CGTGCCCGGG CCGTCGCTTT GTTGCGTGAA GAATTGGTCG AGGAGTGGCT TCCCCAAGCC AGGGGAGTAC CGGTTGCGCG GTGGCTGGCT TTACAATCAT CGGCGGCATT CTGGATTCAT TGCCGCTTCA ATGACACCTT TGCTTTTGTT CAAGCGTACG AGCACCAATT CGGATCCCCG TGGCACCCCC AACGTTGA
|
Protein sequence | MDSESTSERR DIYAVLGSPD SRFPVEAPAD PKEARRLKGG NSFYCSTALG GCGGELTFAI GDVNIPHFRH QAGSRCSLIS SKTLADRYTH LAIQEALRAW IETMPGFSCR LEVSIESGRT DVLVTGPSFE VALEVQRSAL SARNALERTA VYSHRANAVQ WLYAFRDIDA YKAELADRGW SLRIWYGWAK KECRIGVSYE TETGAEVEIK EAGGPLTDWD ISFRGLDSVH LRKAKAAVER LRATERERRL EQAREEAARE AAEKARKEAA RLRHIADQRA AHESLLRALQ HTPEGLENKW PSSWPQLKGS PGQVSWAESI RARAVALLRE ELVEEWLPQA RGVPVARWLA LQSSAAFWIH CRFNDTFAFV QAYEHQFGSP WHPQR
|
| |