Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2749 |
Symbol | |
ID | 4444598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3091817 |
End bp | 3094837 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639690571 |
Product | hypothetical protein |
Protein accession | YP_832228 |
Protein GI | 116671295 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.179531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTCCCGTC CCGCCAGCTC CACTCCGCCC GGAAGACCCC AGCCAAGGCG AGGTGCCTTG ACGCCGACGT TGATCGTCGT AGCACTGGTT GTGGTCGGAT TCATCTTCTT CGCCAATGTC TGGACCGATG TCCTCTGGTA CCAGCAGCTC GGGTTCTTTG AAGTATTCCT CACGGAGAAC CTGGCCCGGA TCATCATCTT CCTTGCCGGC TTCGCGCTGA TGTTCGTGGC CATGTTCTAT GCCATTCGGA TCGCGTACCA CGCCCGTCCC GTCTACGCGC CGGACTCGGA GATCAGGGAC AACCTGAACC GCTACCAGGC TCAACTGGAA CCCGTCCGCC GGGTGGTCAT GATCGGTCTG CCGGTGCTGT TCGGCCTCTT TGCCGGAAGC GCGGCCGCCA GCCAGTGGCA GAAGGTGCTG CTGTTCCTGA ACCAGGAGCC GTTCGGCCAG AACGATCCGC AGTTCAACCT GGACATCAGC TTCTACCTGA TGACCCTGCC GTTCCTCGGC TTCGTGACCG GCTTCCTCAT CAGCGTCGTT GTGGTCGCTG GTATCGCGGG AATCCTGACG CACTATCTCT ACGGCAGCAT CCGGATCATG GAACGCGGCA TCTTCACCAG CCGTGCCGCG CAAATCCACC TCGCCGTCAC CGGTGCGGTC TTCCTGCTTC TGCTTGGCGT GAACTTCTGG CTGGACCGCT ATTCCTCAGT TCAGAACAGC AACGGACGCT GGGCCGGCGC CCTTTACACG GACGTCAACG CCGTCATCCC CACCAAATCG ATCCTGGCTG TAGCCGCCGC GCTGGTGGCA ATCCTGTTCA TCGTCGCCGC AGTGATCGGC AAATGGCGAC TGCCCGTCAT CGGCACGGCA ATGCTGGTCA TCACCTCCAT CCTCGCCGGC GGTGTCTACC CGTGGGTCAT CCAGCAGTTC CAGGTGCGCC CGTCGGAACA GACCCTCGAG AGGCAGTTCA TCGAGCGGAA CATCAGCATG ACCCGCGCCG CCTACGGCCT GGATAAGATC CAGGAGAAGC GGTACAACGC CACCACTAAC GCCACCACAG GAGCACTGGC ACCGGACGCG CAGACCACTG CCAATATCCG CCTCCTGGAC CCGAACCTGA TTTCGGACGC CTTCTCCCAG CTTGAGCAGT ACCGTCCCTA CTACCAGTTC CCGAGCGCGC TCAATGTGGA CCGGTATGAA GTTGACGGCA AGGTGCAGGA CACTGTGATT GCTGTCCGCG AGCTGAACCC GGACGGCCTC AGCGCCAACC AGCAGTCCTG GCTGAACCGG CACGTGGTCT ACACCCACGG TTACGGCGTA GTGGCCGCTA AGGGCAACAA GTTCACCGCC GACGGCAAGC CTGAGTTCCT GCAGGCCGGC ATTCCATCCA CCGGCGTGCT CGGCAACGAT TCGACGTACC AGCCCCGGAT CTACTTCGGC GAAAACTCGC CCGAGTACTC GATCGTAGGG GCACCCGAGG GTTCGCCGCA CCGTGAGCAG GACCGTCCCG CCGGCAAGGA AGGCGATGGC GAAACCCAGT ACACCTTCAC CGGCAACGGC GGCCCGAACG TAGGCAGCTT CTTCAACAAG GTCCTCTACG CGATCAAGTT CCAATCGTCC GACCTGCTGC TGTCCGACGG CGTCAACGCC GAGTCGCAGA TCCTCTACGA CCGCAACCCG CGGGACCGCG TCGAAAAGGT GGCCCCCTAC CTCACGGTCG ACGGCAACGC CTACCCGGCG GTGGTGGACG GCCGCGTGAA GTGGATCGTG GACGGCTACA CCACCAGCCA GTACTACCCG TACTCGCAGC AGGAGCAACT GTCCGCAGCC ACCGCTGATT CGCAGACCAC GGCCGGGCGC ACGGTCGCGT TGCCGAATAG CTCGGTGAAC TACATCCGCA ACTCCGTGAA GGCAACGGTT GACGCCTACG ACGGCTCGGT GACGCTTTAC GCCTGGGACG ATCAGGACCC GGTGCTGAAG GCATGGCAGA ACGTCTTCCC GACATCCCTG AAGCCCTATT CGGAGATGTC CGGCGCGCTC ATGAGTCACG TCCGCTACCC CGAGGACCTG TTCAAGGTCC AGCGCGAACT GCTGGGCCGC TACCACGTCA CGCAGCCGGA CAACTTCTAC ACGAACAACG ATGCCTGGTC CGTGCCGAAC GATCCCACGG TCAAGGAAGA GGTCAAGCAG CCGCCGTTCT ACATGTCACT GCAGATGCCG GACCAGGACA AGCCCGCCTT CCAGCTCACG TCGTCGTTCA TTCCGCAGGT GGTCAACGGC ACCGCTCGCA ACGTGCTCTA CGGCTTCCTG GCCGCGGACT CCGATGCCGG CAACCAGAAG GGCGTGAAGG CGGAAAGCTA CGGCCAGCTA CGGCTGCTGC AGATTCCTCC GGAAGCTCAG GTCCCGGGCC CGGGCCAGGC CCAGAACAAG TTCAACTCCG ATCCCACAGT GTCCCAGGCG TTGAACCTGC TCCGGCAAGG CGCGTCGGCC GTCCTCAACG GCAACCTGCT GACCCTCCCG GTGGGCGGCG GTTTGCTGTA CGTGCAGCCT GTCTACCTCC GCTCCACGGG CGAAACGTCC TACCCCACAC TGCAGCGCGT GCTGGTTGCC TTCGGTGACA AGATCGGGTT CGCGCCGACA CTGGATGAAG CGCTGAACCA ACTCTTCGGC GGCCAGTCGG GCGCCAAGGC CGGTGACTTT GCCAATAACG GCCAGACACC GCCGCCCGCA GCCGGAGGAA GCACTCCGCC GGCCACCGGT GGTACGGACG CCAAGGCGGA ACTGAAAGCC GCACTGGATG AGGCGAACGC AGCCATCCGT GCGGGCCAGG AGGCTCTGGC CAAGGGGGAC TTCGCCGCCT ACGGCGAGCA GCAGAAGAAG CTGTCCGCCG CCCTCCAGAA GGCGATCGAT GCCGAAGCGA AGCTCGGTTC GGAAGGTGCC TCGCCGACGC CGGGAGCCAC CACGGCTCCC ACAGCGACCC CGTCGGCCGC CGCGACGCCG TCGCCCTCTC CGAGTAACTG A
|
Protein sequence | MSRPASSTPP GRPQPRRGAL TPTLIVVALV VVGFIFFANV WTDVLWYQQL GFFEVFLTEN LARIIIFLAG FALMFVAMFY AIRIAYHARP VYAPDSEIRD NLNRYQAQLE PVRRVVMIGL PVLFGLFAGS AAASQWQKVL LFLNQEPFGQ NDPQFNLDIS FYLMTLPFLG FVTGFLISVV VVAGIAGILT HYLYGSIRIM ERGIFTSRAA QIHLAVTGAV FLLLLGVNFW LDRYSSVQNS NGRWAGALYT DVNAVIPTKS ILAVAAALVA ILFIVAAVIG KWRLPVIGTA MLVITSILAG GVYPWVIQQF QVRPSEQTLE RQFIERNISM TRAAYGLDKI QEKRYNATTN ATTGALAPDA QTTANIRLLD PNLISDAFSQ LEQYRPYYQF PSALNVDRYE VDGKVQDTVI AVRELNPDGL SANQQSWLNR HVVYTHGYGV VAAKGNKFTA DGKPEFLQAG IPSTGVLGND STYQPRIYFG ENSPEYSIVG APEGSPHREQ DRPAGKEGDG ETQYTFTGNG GPNVGSFFNK VLYAIKFQSS DLLLSDGVNA ESQILYDRNP RDRVEKVAPY LTVDGNAYPA VVDGRVKWIV DGYTTSQYYP YSQQEQLSAA TADSQTTAGR TVALPNSSVN YIRNSVKATV DAYDGSVTLY AWDDQDPVLK AWQNVFPTSL KPYSEMSGAL MSHVRYPEDL FKVQRELLGR YHVTQPDNFY TNNDAWSVPN DPTVKEEVKQ PPFYMSLQMP DQDKPAFQLT SSFIPQVVNG TARNVLYGFL AADSDAGNQK GVKAESYGQL RLLQIPPEAQ VPGPGQAQNK FNSDPTVSQA LNLLRQGASA VLNGNLLTLP VGGGLLYVQP VYLRSTGETS YPTLQRVLVA FGDKIGFAPT LDEALNQLFG GQSGAKAGDF ANNGQTPPPA AGGSTPPATG GTDAKAELKA ALDEANAAIR AGQEALAKGD FAAYGEQQKK LSAALQKAID AEAKLGSEGA SPTPGATTAP TATPSAAATP SPSPSN
|
| |