Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3221 |
Symbol | |
ID | 4444211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3627591 |
End bp | 3629180 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691045 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_832697 |
Protein GI | 116671764 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3944] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.929706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTCA ACATTGCGGC CGGCGCAGAA GCACCTGCCG GCCTGGACCT GGCAGACTAC CTTCGGGTGG TGCGGGTGTA CTGGAAGGCC ATCGTCGCCT TCACCCTGCT GGCCACATTG ACGGCGTTCG GCTGGACCGT CCTTCAGCCC AAGATCTACT CGTCCGATTC CAGCGGCATT GTGGTCACAC CGGGCTCGGA CAACGTGAGC CTTTCACTGG CCGGGGACAG CCTGGCCAAG GCAAAGGTCA AGAACTACGA GTCCGTGGCC AAGTCCCGCC TCGTGGCTGA CCGGGTCATC GCCTCCCTGG AACTTAAAAC AACCGCAGAC GCGCTACTCG GCACCATCAG CGTCAAGGTT CCGCTGGATA CCGCCGAGAT CCGGGTGACG GCCCAGTCGC CCGATCCGGC AACCGCCCAG CGCGTGGCCG ATGCCTGGGT CAACGGCCTC GCCGCCCAGG TGGAAGCGAT CGAAACAGCC ACACCCGGAA CAGCAACGCC CGACGCCGGC ACCTCGCCGG ACGCTGCCAC CGCACCAGCG GCGACGGCCT CCGCCGTCAG GATCCTCCCG CTGGGCAAAG CGGTCCTTCC CACCAGCCCG GTGTCCCCGA ACGTCAAGCT CACCCTGGCG CTCGGTGCAC TCATCGGTCT GGCCCTCGGC GTGGCCTATG CCCTGGTCCG CCGGCACCTG GACCGGCGCA TCCGCAACGC CACCGAAATC GAACGGCTCT TCGACGTCCC GGTGATCGGC ACCCTCCCCG TGGACCACCG CCTGGACGAG AAAAGCACCA TCCTGGACGC TGGAGCCGCA GCCCAGCTGC ACGACGCCGG CGGGGCGATG GCCGAGGCCC TCCGCGAACT GCGCACCAAC CTCAGCTTCC TGGACGTGGA CCAGCCGCCG CGGATCATCG TGGTCACCAG CTCCATGCAG GCCGAAGGAA AGTCCACCGT CACCGCCAAC CTGGCGGTCA CCATGGCGGC CGCCGGCGAG AACGTCGTAG TGGTCGACGG CGACCTCCGC CGCCCCACGC TGGTGGACGT TTTCAACCTG GTTCCGGGAG TCGGGGTTAC CGACGTGCTC ACCGGCACCG CTGAACTGGA GGATGTCCTC CAGCCCTGGG GTGCCCTGCC GAACCTCTCG GTCCTCGGTT CCGGCCGCAT TCCGCCGAAC CCCAGCGAAC TGCTGGGCTC CAAGGCCATG AAGAACATGC TCAACGCCCT GGCAGAGAAC GCAATCGTGC TGATCGACGC CCCGCCGCTG CTGCCGGTCA CGGATGCTGC GGTACTCTCC CGCGTGGCGG ACGGCGCCAT CGTGGTGATC CGGACGGGCC GGACCACCCA GGAGCAACTG GGCCAGTCCC TGGGCAACCT GGAAAAGGTG AAGGGCCGCA TCCTGGGCGC CGTCCTGAAC TACGTGCCCA CCAAGGGCAC GGACGCCTAC TCCTACTACG GGACGTACAC CTCGGCTCCT GAAACGCAGG ACCTCCCGGA GCTCGCCCAT CCCGACGCCG TGGCGCACGA ACCGCAGTGG GACACCGAAC ACGACGACGT CCTGGAACCC GCCGCGGCCG GCCGCCGCGC ACGGGCCTAG
|
Protein sequence | MSVNIAAGAE APAGLDLADY LRVVRVYWKA IVAFTLLATL TAFGWTVLQP KIYSSDSSGI VVTPGSDNVS LSLAGDSLAK AKVKNYESVA KSRLVADRVI ASLELKTTAD ALLGTISVKV PLDTAEIRVT AQSPDPATAQ RVADAWVNGL AAQVEAIETA TPGTATPDAG TSPDAATAPA ATASAVRILP LGKAVLPTSP VSPNVKLTLA LGALIGLALG VAYALVRRHL DRRIRNATEI ERLFDVPVIG TLPVDHRLDE KSTILDAGAA AQLHDAGGAM AEALRELRTN LSFLDVDQPP RIIVVTSSMQ AEGKSTVTAN LAVTMAAAGE NVVVVDGDLR RPTLVDVFNL VPGVGVTDVL TGTAELEDVL QPWGALPNLS VLGSGRIPPN PSELLGSKAM KNMLNALAEN AIVLIDAPPL LPVTDAAVLS RVADGAIVVI RTGRTTQEQL GQSLGNLEKV KGRILGAVLN YVPTKGTDAY SYYGTYTSAP ETQDLPELAH PDAVAHEPQW DTEHDDVLEP AAAGRRARA
|
| |