Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1703 |
Symbol | |
ID | 4445776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 1902065 |
End bp | 1903990 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639689525 |
Product | hypothetical protein |
Protein accession | YP_831197 |
Protein GI | 116670264 |
COG category | [S] Function unknown |
COG ID | [COG4289] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.57569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCGCC ACGAGCACCT GATGCCAACA GCCACTTCGA CTGAGGGATC ACTAACGGGC TGGGACCGCG CATCGTGGGA AGACCTTGCA GATAAGCTCA TCGCAGGGGC CCTGACCCAC TCCTCACCCC GAGGCGCCCG GGTGGCATTT CCTTCAGCGA CGGGGCAGAA CGACACTGCC CAGCTCGAGG GATTCGCAAG ATCCTTTCTG CTTGCATCAC TCCGGATCGC AGGATCCCCC GGTGCGCGGC CTGACCTCAT CGGGTGGTAC GCGGATGCGC TGGCTGCAGG GACACAGGCC GGCGGTGTCG AAGCCTGGCC CGTTTTGCGT GACCACGGGC AGCCCACCGT TGAAGCGACA GCGATCGCTC TTGCCCTTCA CTTGTCCAGA GCATGGCTCT GGGAATCATT GGACGACGGT GTCAAAAATC GGGTTGTGCT CTGGCTCTCC GGATCGAGGG GCAGGTACGG AGCAGACAAC AACCATGTGT TGTTCGGGGC AACCATCCAG GCTTTTCTCG CCTCAGTGGG CGCGCCCTTT GATGTTCTCG AAATTGAGGG GGCATTGGAC CGGATTGAGG ACTGGTACGC CGGGGACGGC TGGTATTCCG ACGGGGAGGG CCGCCGCTTC GATCATTACA ACGCCTGGAC TTTTCACTTA TACCCCTTCT TCATCGTGGA CATGCTTTCC GGGACCGCAG GACAGGCAAC CGGATCGTCC GAACGCCTCC GACTGTATAG AAGCCGTCTG CGGGAGTTTC TTCGCGGGTA TGAACACTTG TTCTCCGCCG CAGGCAGTCC CCTGATTCAG GGGCGGTCCC TCATTTACCG TTGGGGTGTC GTTGCGCCGT TCTGGATGGG CGAAATCCAG GGCGTCTCCC CTCTGCCTGC CGGCCGGACC CGGCGGCTCG CCTCCGGCGT AGCCAAGCAT TTTGTGGACC ATGGTGTAGG CGCTGACGGT GTCCTGTCGT TGGGCTGGTG GAAAGAGAAC ACCGGGATTC TTCAGTCTTA CAACGCCCCG GGATCACCGT TGTGGTCAAG CAAAGGGTTC CTGGGCCTGC TCCTTCCGGT TGAACACCCG GCCTGGTCCC ACGACGAGAC CGGGCTGGAG ATCGAACGCC GGGACGTCAG GGAAGTGCTC TCCGGTCCGC AGTGGCTGGT CCACGCCACC CAGCGGGACG ATATAGTTCG CGTCGCCAAT TTCGGCTCGG GAGGGCATCC GCGCTACGAC TCCCACCTGT ACAGACGTTT AGCGTTCAGT ACGGCCACTG CGCCGGTCCA GTACGGGGAC CTTAGGGACA ACGACATCTA TATTCCCCTG CAGGACGCCA CGAGCACCCA CCGTGGCCCG CTGGGAGGAG TCGCCCGCCC GCATGGAGGA AGCCTTCGGT TCCTCCTGGA TGCTGCCGGA CGAGGAGTGT CTGTCGATTA TGCCACCACC ATCCTTGATG GTTCTGTCGA GCTGAGAGCG GCAAGAGTCC GCGGCGCAGT TGCGCTGCCG CTCACTGTCA GCGGATACGC GCTCTCCTCA GATCAGCCGA TGGACACTGG TGTCCGGGGC GGCTGCGCCC GGGCCACAAC CCACGACGGA CTCTCTTCGT CCATTGCCCT GGTCAGTATC CAGGCGGACG CCAGTTCCGC CCCTGCTGCG CAGATCCGGT ATGCACCGGA GTCAATCCTG GGTGAAAAGG TGGCTGTTCC GGCGGTCAGA ATCTCACCAG CAAGAAGTAC CGAGATACGC ATCGCCTGGC TTGTCTCCCT CTCACGTCGG GAGGTGGACC TCTCAGCCAT TGCCGCCGGA TTGGAGCTTA ACTGGTCCAA CCAGTGCCTG CACGCGTCGA TTGGAAACAC CGCACGGTCT CTGCCGTGGA TGCGGCACGA TAAGTGGCCG GCGGACAGCA TCAATCAGGG GATCTTCGGA GGATAG
|
Protein sequence | MTRHEHLMPT ATSTEGSLTG WDRASWEDLA DKLIAGALTH SSPRGARVAF PSATGQNDTA QLEGFARSFL LASLRIAGSP GARPDLIGWY ADALAAGTQA GGVEAWPVLR DHGQPTVEAT AIALALHLSR AWLWESLDDG VKNRVVLWLS GSRGRYGADN NHVLFGATIQ AFLASVGAPF DVLEIEGALD RIEDWYAGDG WYSDGEGRRF DHYNAWTFHL YPFFIVDMLS GTAGQATGSS ERLRLYRSRL REFLRGYEHL FSAAGSPLIQ GRSLIYRWGV VAPFWMGEIQ GVSPLPAGRT RRLASGVAKH FVDHGVGADG VLSLGWWKEN TGILQSYNAP GSPLWSSKGF LGLLLPVEHP AWSHDETGLE IERRDVREVL SGPQWLVHAT QRDDIVRVAN FGSGGHPRYD SHLYRRLAFS TATAPVQYGD LRDNDIYIPL QDATSTHRGP LGGVARPHGG SLRFLLDAAG RGVSVDYATT ILDGSVELRA ARVRGAVALP LTVSGYALSS DQPMDTGVRG GCARATTHDG LSSSIALVSI QADASSAPAA QIRYAPESIL GEKVAVPAVR ISPARSTEIR IAWLVSLSRR EVDLSAIAAG LELNWSNQCL HASIGNTARS LPWMRHDKWP ADSINQGIFG G
|
| |