Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0468 |
Symbol | |
ID | 4447058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 496333 |
End bp | 499605 |
Gene Length | 3273 bp |
Protein Length | 1090 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639688266 |
Product | YD repeat-containing protein |
Protein accession | YP_829968 |
Protein GI | 116669035 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTGT CCTTTGTTGT TGTGACATCA GTGGCTTTGA CAGGGATGGG CGCACCTGCT GTGGCCGCCC TTGACGCGTC AGTACTGACC GCCCCGGCGG CTCCATCACC TCCGTCCAGT CCAGTAACGG AGGCGAAACT GGAACCGGCG CCTGAGCCTG TCCAGGTCGA GCCAGAACCG GGGGATGAAG TACCTTCCTC TGATGCTGAA GGTACGCCCA TCATCCCGGA GGAAGCGCCG GCGGCTTTGG CTGTCGCCGC CTTGGCCCCG GCAACCGGAG CGAATGGCGT CGCGTCAGGC AACCGTGCCG CTGCGTCCAT GGTCACCCGC CAGCTTTCGG ATGCCGCCGC ACTTTCCTGG AATCCGACGA ACGGGAACAT CGTGCTGACC GGGGATCTGC TGCATCTGAA AGGCGTCACC CGGGACATGG ACGTGAAATG GCGCTACAAC AGCCTCAATG ACGTCCGCCC AACGCTTTCG GTCGGTACCC GTGAAGCTGG CGTCAGGGTC GATGCGTCCA ACAACATCAC GTACACCGCA GCGGACGGCG GGGAGTACAC CTTCGTTCCC GACGGAGCCG GAGGATGGAC CCGGCCCGCA GGCCTGAATG CCTCTATCCG GTCGCTTTCC GCAACCAACC TCAACATCAA TTTCGATGAC ACCGGATATG GCAATGGATA CGTCAAGAGC GGGGACGTGT ACGTCCTCCG AAGCGAGGAC GACCACTACG GAGACCTTCC CAACCGGGCC TCGTACTACG ACACCGACGG CCGACTCAGT GCCAGCTCGG ATAACCGGAA CCAGATCATC AACTACGTCT ACCAGGACGC GAATAACGAG GAACAGCCCT CCCAGATCAT CGACACCACG ATCAACCGGG CGATCAACAT CGAGTACAAC GGCGGATCCG GACGGATGTC CAAGATCACC AACGCCTCCG GCGCGGCCCT GTCCTTCACC TACAACTCCG CCGGCAAGAT TGCCACCGTC AAAGACGGCC GCGGCACCAC CACCACCCTG GAGTACGACA CCAATGGCAG GGCAAAGAAA ATAACCTACG GCACCGGCAC CACCGCCCAA AGCATCTTCA CCCCGGCTTA CCCCACCGCG ACATCCTCGA CGCTGACGGA CCCGAACAAT AAAACAGCGA CCTACGCTTT CACAGGAGCC CGGCAGGTCA CCCAGGTCAC CGATCCGAAC GGGAACACCA CCCAAGCCAC CTGGGACGCC CACGACAACC GCCTGACCTC AGTGGACGGG CTGGCCAAGA CAACCACCGC CACGTACAAC CTCAATAACT CCCTGACCAA GATCACCAAG CCCGCAGCCG GGACCGGGAC AGGAAGCGAA ACGATCTACA CCTATGCCGG CCCGAACGGC CGCCTACCGA TCGCGTCCAC AAACAGCGAA GGACACGTCA CCACCTATGG CTACGACGTG AACACTGCCA ACATCGACAA GGCACAGACA CCAGGGCCCA ACGGCGGCGC AGGCGGCGCC CGAACCTGGA ACTTTGAGCG CGATGACGAA CTAACCACCT GCGGAGCTCA GCGCGGTCAG CTGTGCAAAA CCACGGACGG CAACGGCAAC GTCACCAGCT ACGCCTACGA CGCAAACCGC AACCCCGTCA CGATCACCCG CCCAGCACCC CTTGGTGTCA TCACCAACAC CTTCGACGCC GCAGACCGGC TGATCACGTC CAAAGATGGC AAGAACCAGA CCCAGACCCA CACCTACGAC AACAATGACC GGATCACCCA GACCCGCCAG GGCGCCACCT GCATCCCCGC GACCTGCGTC ACCTACACCT ACGACGCGAA CGGCAACCTC ACCCAGCGCG TCGACGCAGG GGGCACCACC ACCATCACTT ACGACGCCCA GAATAGGCCA ACGACTAAGA CCATCGGCGG CACCACCACG ACGCTCACCT ACGACGGGGC CTCCAACATC CTGACCAGTG TGGACCCGCT CGGGACCGTG ACCTATAAAT ACGACGCCGG GAACCGGCTG ATCTCCCTCG CCGAGCCCGG CGGATCCTGC CCCGCAACCC CGGTATTTCC GAACAGCACC AAATGCACGG GCTTCGAATA CGACGCCAAC AACAACCGCA CCGCCACGAA ATACCCCAAC GGGATGAAGA ACACCACGGT CATCGACGCG GCCGGGCGGA CCACCTCCAT CACCGCGACC AACACCACCG CGGGCGTCCT GGCCAGACGC GCCTACACCT ACACCGTCAA CGGCACCAAA GACGGTGCCC TGCGCAAAAC CGTCACCGAC CATGCAGGCA CCGTCACCAC CTACAACTAC GACGAGGTCA ACCGGCTCAC CCAGGCCGTG ACCGGGACCA ACACCGAAAC CTGGGCCTAC GACAAGAACA GCAACCGGAC CGTCGATACC AAGACCGGGA CCGCGAACGT GTACAACGCC TACAACGGAG CCGACCAGCT CTGCTGGGTA GGGGCCAGCG CCGGGACCTG CGCCTCACCC CCGGCAGGGG CCGTCACCTA CGCCTACGAC GCCAACGGCA ACACCACCAC CGCCGGGGCA ACCACCCAGA CCTACAACGT CTTTGACCAG TTCACCTCCA ACACCAACGG CGGCACCACC AACTACGCCT ACGCAGGCAC CCGCAACGAC GAACGCACCA CCGCCGACGG AACCGCGTTC CTCAACGGTG CACTGGGCAT CACCCGCCAA ACCACCGGCG GGGCAGCCAC GTCCTTCATC CGTGACCCGG ACGGTAACCT CGTCAGCATG CGCACCAGCA CCGGAGCCAG CTACTACTAC ACCACCGACG CCCTCGGCTC GGTCATCCTC CTCACTGACA GCGCCCAGAC GAAAGCCGCC GAATACGCCT ACGACTCCTG GGGACTGACC ACCACCAACA GCGGCGCCCA GGCAGCTGTG AACCCGTGGA CCTACGCCGG CGGGTACAAC GACACCACCA GCAACCGCAT CAAATTCGGC GCCCGCTACT ACAACCCCTG GCGCGGACGC TTCACCCAAC CCGACCCATC AGGCCAAGAC CAAAACCGTT ACGCCTACGT AAGCTGCAAC CCAATCAACG CCACAGACCC AACGGGCCTC GGCCCTGCCG AATGCTTTTT CAACGCTTCG GCCGCTGTAT TTTCCGCTTT CGTTGTTGCG GGGGCAGGCG CCGCCGCTGT CGCAACTGGC GGTTTGGCAG TCGTTGGCTT GATCGGGGCG ATTGGACTTG AGGCTACTAC TGCCACTGCC GCTGGCTACT ACTGTGGAGA GCTGATCAAA TGA
|
Protein sequence | MALSFVVVTS VALTGMGAPA VAALDASVLT APAAPSPPSS PVTEAKLEPA PEPVQVEPEP GDEVPSSDAE GTPIIPEEAP AALAVAALAP ATGANGVASG NRAAASMVTR QLSDAAALSW NPTNGNIVLT GDLLHLKGVT RDMDVKWRYN SLNDVRPTLS VGTREAGVRV DASNNITYTA ADGGEYTFVP DGAGGWTRPA GLNASIRSLS ATNLNINFDD TGYGNGYVKS GDVYVLRSED DHYGDLPNRA SYYDTDGRLS ASSDNRNQII NYVYQDANNE EQPSQIIDTT INRAINIEYN GGSGRMSKIT NASGAALSFT YNSAGKIATV KDGRGTTTTL EYDTNGRAKK ITYGTGTTAQ SIFTPAYPTA TSSTLTDPNN KTATYAFTGA RQVTQVTDPN GNTTQATWDA HDNRLTSVDG LAKTTTATYN LNNSLTKITK PAAGTGTGSE TIYTYAGPNG RLPIASTNSE GHVTTYGYDV NTANIDKAQT PGPNGGAGGA RTWNFERDDE LTTCGAQRGQ LCKTTDGNGN VTSYAYDANR NPVTITRPAP LGVITNTFDA ADRLITSKDG KNQTQTHTYD NNDRITQTRQ GATCIPATCV TYTYDANGNL TQRVDAGGTT TITYDAQNRP TTKTIGGTTT TLTYDGASNI LTSVDPLGTV TYKYDAGNRL ISLAEPGGSC PATPVFPNST KCTGFEYDAN NNRTATKYPN GMKNTTVIDA AGRTTSITAT NTTAGVLARR AYTYTVNGTK DGALRKTVTD HAGTVTTYNY DEVNRLTQAV TGTNTETWAY DKNSNRTVDT KTGTANVYNA YNGADQLCWV GASAGTCASP PAGAVTYAYD ANGNTTTAGA TTQTYNVFDQ FTSNTNGGTT NYAYAGTRND ERTTADGTAF LNGALGITRQ TTGGAATSFI RDPDGNLVSM RTSTGASYYY TTDALGSVIL LTDSAQTKAA EYAYDSWGLT TTNSGAQAAV NPWTYAGGYN DTTSNRIKFG ARYYNPWRGR FTQPDPSGQD QNRYAYVSCN PINATDPTGL GPAECFFNAS AAVFSAFVVA GAGAAAVATG GLAVVGLIGA IGLEATTATA AGYYCGELIK
|
| |