Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2228 |
Symbol | |
ID | 4445289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2506390 |
End bp | 2507955 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639690037 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_831708 |
Protein GI | 116670775 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000756142 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGCGAC GCCGAGACAC TCCCGGACAA GGCATTGGTT CGCCAGCGTC CGACGCCGCT GCCAGGCATT CGGACACGAA GGACATGGGC CCCGCCCGGC ACCTCGGCGC CATGGGCGGA AAGCCCGCGT GGTTCAAGGT GGCCACTGCT GTTGTGGCCC TTGTCCTGGT GGGCGCTCTC GCTTTCGCCG CCTTCTGGGT CATCCGCTTG CAGATGAACA TCAGCAAAGC TCCACTCGGC GCAGGCAGCA GCCGCACCGA AGATCCCGTC AACGACTCCA AGGACCGGAT GCAGATCCTG ATCCTGGGCT CGGACACCCG CGACGGCAAG AATTCCGACT ACGGCACGGC CGAGGACTCC ACAGGCTACG GCCAGTCGGA CGTCATGATG ATGATGGACA TCTCGGCGGA CAACAAGCGT GTCAGCGTCA TCAGCTTCCC GCGCGACCTG CTCGTGGACA TTCCCGAATG CACGGACCAG AAGACCAAGC AGGTGTTCCC GGCCCGAAGC GGCGTGATGA TCAACGAAGC CATGAAAGAG GCCGGCATCG GCTGCGCCGT GGACACGGTG AACAAAATCA CGGGGTTGGA AATCGACCAC TTCATGATGG CGGACTTCAA CGCGGTCAAG GAACTTTCCA ACGCGGTGGG CGGCGTGGAA GTCTGCGTAA GTGACGCCGT CTACGACCCC GACTCCCGCC TGCGCCTCCC CGCAGGAAAC TCGCAGGTGC AGGGCGAGCA GGCGCTGGCC TACCTGCGGA CCAGGCATGC CTTCGCGGAC GGCGGTGACC TGGGCCGCAT CAAGGCGCAG CAGGGCTTCC TGTCATCCCT CACCCGCAAG ATCAAGGATG ACGGCACACT GTCCGACCCC CAGAAGATGC TCAAGATTGC CGACGTCGTC ACGCAGAACC TTACGGTGGA TGATGGACTG GCGTCCGTCC CGTCGCTGCT GACCATCGGC AACCGGCTCA AGAACATTGA CATCAGCAAG GTGGCGTTCG TTGCCGTGCC AACCACGCCT GCTCCCACTG ATCCCAACCG GCTCACCGTT GCCGAGCCGG CCGCATCGCA GCTTTTCGCC GCGCTGCGCA AGGACGTCGA CCTGACCGAC CCGACAGCCA CCCCGAGCCC CACGGCGGAG CCGAGCGAAT CGGCTCCCGC CCCGACGCCG ACCGAAACGC CGCTGCCGCC CTACGATAAG GCGCTGCAGC CGGTGACCGT CGCGAACGGA ACGGGTGTTC CGGCGCGGAC CCAGGAGATC ACCCAGGCGA TCATCGCCGG CGGCTTCACC CAGGTGGCCC CGCTTGTGGC GCAGCCTGTC GCGAAGACGG CGGTCTACTA CGGACCCGGC TTCGAGGACG TGGCGGCGGA CGTCGCAGCA TTGCTGGAAA TACCCGCCAC GCAGGTTCTC CCGGCGGCCG GCGTCAGCGG AGTTCAGGTC TACCTCGGCA CCGACTTCAT GTCCGGAACA AAGATGGACT CCGTGCCGCT CCCGTCCGAC ATTGTCAACC AAACGGCCGG CGACACCGTC TGCCAGCAGG CGAACCCTGA ACTGATCGTC CGCTAG
|
Protein sequence | MVRRRDTPGQ GIGSPASDAA ARHSDTKDMG PARHLGAMGG KPAWFKVATA VVALVLVGAL AFAAFWVIRL QMNISKAPLG AGSSRTEDPV NDSKDRMQIL ILGSDTRDGK NSDYGTAEDS TGYGQSDVMM MMDISADNKR VSVISFPRDL LVDIPECTDQ KTKQVFPARS GVMINEAMKE AGIGCAVDTV NKITGLEIDH FMMADFNAVK ELSNAVGGVE VCVSDAVYDP DSRLRLPAGN SQVQGEQALA YLRTRHAFAD GGDLGRIKAQ QGFLSSLTRK IKDDGTLSDP QKMLKIADVV TQNLTVDDGL ASVPSLLTIG NRLKNIDISK VAFVAVPTTP APTDPNRLTV AEPAASQLFA ALRKDVDLTD PTATPSPTAE PSESAPAPTP TETPLPPYDK ALQPVTVANG TGVPARTQEI TQAIIAGGFT QVAPLVAQPV AKTAVYYGPG FEDVAADVAA LLEIPATQVL PAAGVSGVQV YLGTDFMSGT KMDSVPLPSD IVNQTAGDTV CQQANPELIV R
|
| |