Gene Achl_3443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3443 
Symbol 
ID7294924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3816040 
End bp3817656 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content69% 
IMG OID643591850 
ProductSpore coat protein CotH 
Protein accessionYP_002489489 
Protein GI220914180 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGTT CCCCCGGTGC CATTCTTTCC GCATCCGCCC TGGCGGCAGC CCTTGCCCTG 
ACCGGGTGCG GTGCTGGGGC AGGTGCAGAG CTGTCCGGCA CGCCGGCCAC GTCCGCAGCG
TCGTCCGACG CGGGAGTTGA CGCCGGCACC AGCACCACGG AGACCACGAC GGCGGACTCC
ACCCTGTTCA CGGACGGCGC CTCCCACACG GTCAGCGTCA CGTGGAACGA GGACGACTAC
GCCGCAATGA TCGCCGCCTA CGAGGCCGAC GGGTCCAAGG ACTGGATCGC CGCGGACATC
ACCATCGACG GCACCACCGT GTCCAACGTG GGCGTGCGCC TCAAGGGCAA TTCCACGCTT
CGGAGCCTGA GCGGCAGCGG GAACGGCGCG GGCGGCGGGA TGGGCGGGAA CGCGGCGTCG
TCCGGGATCT CCTCCGACGT CCCTGAGTCC CTGCCCCTGC TGATCCGCTT CGACAAGTAC
GTGGACGGCC AGACCTACCA GGGGCTGGGC GAGGTGTCGC TGCGCCCTGG CTCTCCCGTC
CTGAATGAGG CGCTGGCCCT GGCCCTCACC GGGGCCAGCG GCCAGGCCAC GCAGCGGTAC
GCCTACACCA CCTACTCAGT GAATGGCAGC CCCACCCAGA CCAGGCTGCT CGTGGAAAAC
CCCGATGAGG ACTATGCGGA TTCCCTGTTC GATACCCCGG GAGTCCTCTA CAAAGCCGAT
GCCGATTCCA GTTTCACTTA CCAGGGCGAC GACCTGGCCA CCTACGAGGA CCAGTTCAAG
CAGCTCAACA ACGGGGAGAG TGAGACTGTC CAGCCCATCG TCGACTTCCT CAAGTGGCTG
TCCGAAGCCA CCGACGAGGA ATTCGACGCC GGCCTGGCGG AGCGCGTGGA TGTGGAGTCC
TTCGCCCGCT ACACCGCCAC GATGAACCTG CTGGTCAACG GCGATGACAT GGCCGGCCCC
GGCCAGAACT ACTACCTCTG GTACAGCCTG GACACCAGGA AGATCTCCGT CATCTCCTGG
GACCTCAACC TCGCCATGAC CGGCGACGCC ACGGCATCGC CCGAGGCGCA ACTGTCCATC
GGAGGCGGCG GAGGCGGCGC GGGCGGCGGA GGCGGCGGAG GCGGCGCGGG CGGCGGAGGC
GGCGGAGGCG GCGCGGGCGG CGGCATGCAG CCTCCGGGAT CGGACGACGG CGGCCGGGCA
CCTTTTGCCG GCGCCGCGGC GGATGCTGGC GGCGCAGCGG ATGCTGACGG CACGGCAGCA
GCGGATGGGG CAGGGCCGGG CGAAGCGGCC CCGGGCGGAG CAGGGCCGGG TGAAGCGGCA
ACAGGTACCT GGGATGCCGC AGCAGGTACC GGCGCAGGCG GTGGAGGCGG CCGCGGCGGC
AACGAACTGA AGGAAAGGTT CCTGGCTTCG GATGCCTTCC AGTCCGTGTA TGACGCCGCC
TACGCGGATC TTTACGCCCA GCTGTACGCC AGCGGAACCG CCGCCAGCCT CCTGGACTCA
ATTGCCGCCG TCGTACCCCT CAGCGACGGC CTCACCGCCG AGGAACTGGC TGGTGAAACG
CAGACCCTGC GCACTTTCAT CCAGGAACGC ACGGATGCGC TGAAGGGCCA GGTCTAG
 
Protein sequence
MRRSPGAILS ASALAAALAL TGCGAGAGAE LSGTPATSAA SSDAGVDAGT STTETTTADS 
TLFTDGASHT VSVTWNEDDY AAMIAAYEAD GSKDWIAADI TIDGTTVSNV GVRLKGNSTL
RSLSGSGNGA GGGMGGNAAS SGISSDVPES LPLLIRFDKY VDGQTYQGLG EVSLRPGSPV
LNEALALALT GASGQATQRY AYTTYSVNGS PTQTRLLVEN PDEDYADSLF DTPGVLYKAD
ADSSFTYQGD DLATYEDQFK QLNNGESETV QPIVDFLKWL SEATDEEFDA GLAERVDVES
FARYTATMNL LVNGDDMAGP GQNYYLWYSL DTRKISVISW DLNLAMTGDA TASPEAQLSI
GGGGGGAGGG GGGGGAGGGG GGGGAGGGMQ PPGSDDGGRA PFAGAAADAG GAADADGTAA
ADGAGPGEAA PGGAGPGEAA TGTWDAAAGT GAGGGGGRGG NELKERFLAS DAFQSVYDAA
YADLYAQLYA SGTAASLLDS IAAVVPLSDG LTAEELAGET QTLRTFIQER TDALKGQV