Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3443 |
Symbol | |
ID | 7294924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3816040 |
End bp | 3817656 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643591850 |
Product | Spore coat protein CotH |
Protein accession | YP_002489489 |
Protein GI | 220914180 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5337] Spore coat assembly protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGTT CCCCCGGTGC CATTCTTTCC GCATCCGCCC TGGCGGCAGC CCTTGCCCTG ACCGGGTGCG GTGCTGGGGC AGGTGCAGAG CTGTCCGGCA CGCCGGCCAC GTCCGCAGCG TCGTCCGACG CGGGAGTTGA CGCCGGCACC AGCACCACGG AGACCACGAC GGCGGACTCC ACCCTGTTCA CGGACGGCGC CTCCCACACG GTCAGCGTCA CGTGGAACGA GGACGACTAC GCCGCAATGA TCGCCGCCTA CGAGGCCGAC GGGTCCAAGG ACTGGATCGC CGCGGACATC ACCATCGACG GCACCACCGT GTCCAACGTG GGCGTGCGCC TCAAGGGCAA TTCCACGCTT CGGAGCCTGA GCGGCAGCGG GAACGGCGCG GGCGGCGGGA TGGGCGGGAA CGCGGCGTCG TCCGGGATCT CCTCCGACGT CCCTGAGTCC CTGCCCCTGC TGATCCGCTT CGACAAGTAC GTGGACGGCC AGACCTACCA GGGGCTGGGC GAGGTGTCGC TGCGCCCTGG CTCTCCCGTC CTGAATGAGG CGCTGGCCCT GGCCCTCACC GGGGCCAGCG GCCAGGCCAC GCAGCGGTAC GCCTACACCA CCTACTCAGT GAATGGCAGC CCCACCCAGA CCAGGCTGCT CGTGGAAAAC CCCGATGAGG ACTATGCGGA TTCCCTGTTC GATACCCCGG GAGTCCTCTA CAAAGCCGAT GCCGATTCCA GTTTCACTTA CCAGGGCGAC GACCTGGCCA CCTACGAGGA CCAGTTCAAG CAGCTCAACA ACGGGGAGAG TGAGACTGTC CAGCCCATCG TCGACTTCCT CAAGTGGCTG TCCGAAGCCA CCGACGAGGA ATTCGACGCC GGCCTGGCGG AGCGCGTGGA TGTGGAGTCC TTCGCCCGCT ACACCGCCAC GATGAACCTG CTGGTCAACG GCGATGACAT GGCCGGCCCC GGCCAGAACT ACTACCTCTG GTACAGCCTG GACACCAGGA AGATCTCCGT CATCTCCTGG GACCTCAACC TCGCCATGAC CGGCGACGCC ACGGCATCGC CCGAGGCGCA ACTGTCCATC GGAGGCGGCG GAGGCGGCGC GGGCGGCGGA GGCGGCGGAG GCGGCGCGGG CGGCGGAGGC GGCGGAGGCG GCGCGGGCGG CGGCATGCAG CCTCCGGGAT CGGACGACGG CGGCCGGGCA CCTTTTGCCG GCGCCGCGGC GGATGCTGGC GGCGCAGCGG ATGCTGACGG CACGGCAGCA GCGGATGGGG CAGGGCCGGG CGAAGCGGCC CCGGGCGGAG CAGGGCCGGG TGAAGCGGCA ACAGGTACCT GGGATGCCGC AGCAGGTACC GGCGCAGGCG GTGGAGGCGG CCGCGGCGGC AACGAACTGA AGGAAAGGTT CCTGGCTTCG GATGCCTTCC AGTCCGTGTA TGACGCCGCC TACGCGGATC TTTACGCCCA GCTGTACGCC AGCGGAACCG CCGCCAGCCT CCTGGACTCA ATTGCCGCCG TCGTACCCCT CAGCGACGGC CTCACCGCCG AGGAACTGGC TGGTGAAACG CAGACCCTGC GCACTTTCAT CCAGGAACGC ACGGATGCGC TGAAGGGCCA GGTCTAG
|
Protein sequence | MRRSPGAILS ASALAAALAL TGCGAGAGAE LSGTPATSAA SSDAGVDAGT STTETTTADS TLFTDGASHT VSVTWNEDDY AAMIAAYEAD GSKDWIAADI TIDGTTVSNV GVRLKGNSTL RSLSGSGNGA GGGMGGNAAS SGISSDVPES LPLLIRFDKY VDGQTYQGLG EVSLRPGSPV LNEALALALT GASGQATQRY AYTTYSVNGS PTQTRLLVEN PDEDYADSLF DTPGVLYKAD ADSSFTYQGD DLATYEDQFK QLNNGESETV QPIVDFLKWL SEATDEEFDA GLAERVDVES FARYTATMNL LVNGDDMAGP GQNYYLWYSL DTRKISVISW DLNLAMTGDA TASPEAQLSI GGGGGGAGGG GGGGGAGGGG GGGGAGGGMQ PPGSDDGGRA PFAGAAADAG GAADADGTAA ADGAGPGEAA PGGAGPGEAA TGTWDAAAGT GAGGGGGRGG NELKERFLAS DAFQSVYDAA YADLYAQLYA SGTAASLLDS IAAVVPLSDG LTAEELAGET QTLRTFIQER TDALKGQV
|
| |