Gene AnaeK_3397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_3397 
Symbol 
ID6783894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp3841865 
End bp3843538 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content73% 
IMG OID642764862 
ProductSpore coat protein CotH 
Protein accessionYP_002135739 
Protein GI197123788 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGCA CCTCGGCGGC CGCCGTTCTC CTGCTTCCGT TCCTCCTGCT CGCCTGCAAC 
GGCGCGCCCC TCCCCGACGC CTCGCTGGGC GGCCAGCCGG GCGGTGGTGG ACCGGGCGGC
CAGCCGGGCG GAGGCGGTGG CGGCGGCGGC GCCGGGGCGG TGAACCCCGA TGCCCCGCTC
GCCGCGCCCG CCTGGCCTCC GCTGCAGACC AGCGTCGAGC CGTACGCGCT CGCGTTCGAC
GACGCGACGG CGGCCATCGT CTACACGATG TGGAGCAAGG AGTACGCGCC GGGCCGGTTC
CACTACCACG GCGCCTGGTG GGACGTGGAG CTGCGCCAGC GCGGCGACGG CTCGCGCGAG
CACCCCAAGC ACAGCTGGAA GGTGCGCCGC CCGAAGGACG CGCCGCTCGA CGGCGAGCGC
ACGCGCAACC TGCTCGCGGA GTGGCCCGAC GGCGGGTACC TGTCGGACCC GTTCTCCTAC
GGGCTCATGA AGGGCGCCGG CGTCCCCACG CCGCGCGCGC AGTTCGTGAC GCTCGACGTG
AACGGCGAGC ACCAGGGCGT GTACGTCGAG CTCGAGGAGC CGGACGAGAA GCACTTCCTC
CTGGAGAACG GCATCGACTC GAACGCCAAC GTCTACCGCT GCGGGCTGCG CGACTGCGAG
CTGAAGCTCA CCCCGCCCGC CCACTACCAG GGCCCGTGGG AGAAGAAGAC GAACGAGAGC
GAGCCCTCCG ACGACCTCGA CGCGTTCCTG ATCGGCCTGA ACCGCACGCC CGAGGGAGAG
ATCGAGGCGT GGCTGGAGCA GCACGTCGAC CTGCCGCGCT TCTTCCGCTT CTACGCGGTG
GGCATCCTCA TCAGCCTGTC CGGCATCGAC GACTCCGGCA GCTACCTCGT CCACGACCGC
ACGCGCGACA AGTGGCTGTG GGTGCCGTGG GATCTCAACA ACGCGAAGCT GGTGTTCTGG
CGCGACAACC CGGTGGAGTG GGGCGTGCCG TTCCGCTACG CGATCCCGTT CTACACGCTG
TACGACGCGG GCACCCTCGG CGTCGCGGCC GGCAAGGAGG CGCGCTACGG CGGCGCGCAC
CCGCCGTTCG TGGTGCTGTT CCAGCGCATC TGGGACCGGC CGGCGCTGCG GAACAGGATC
CTCGACGAGG TGGAGGCGAT GCTCGACGGC CCGTTCGCCG AGGCCGAGAC GTCGCCGCGC
ATCGACGGGC TGCGCGCGCT CATCGCCGGG CTGCTCCCGG CCGATCCCTG GGTGGATCCG
GCGCACGCGG ACGCCTCGGT GCAGGTGCTG AAGGACTACG TCCGTCGCCG GACCGGGTTC
CTGCGCGAGC AGATCGCGCT CGAGCGCCAC CGCGGCGAGG GAGGCCTGGT CGTGAACGCG
ATCGCGCCGG ACGCGATCGA GCTCTACAAC CGCGAGGACG CGCCGCGCGA CCTGGGCGGC
CTGGCCCTGA CCGGCGACCT GCGCCAGCGG CTCGCGACGC TGCTGCCCGC CGGCACGGTG
GTGCCGCCGC ACGGGACGCT GCGGCTCCCG TTCCCGGTGG CGGCCGAGGG CGGCGAGGTC
GGGATCTTCG ACGTCGCCAG CCAGCTGCCG GTGGACGCCG TCTACTACGG GCCGCCCGGC
GGGCGGACCT ACGCGCGCAC GCCGGACGGC GCGGAGACGT GGGCCTGGCG ATGA
 
Protein sequence
MRRTSAAAVL LLPFLLLACN GAPLPDASLG GQPGGGGPGG QPGGGGGGGG AGAVNPDAPL 
AAPAWPPLQT SVEPYALAFD DATAAIVYTM WSKEYAPGRF HYHGAWWDVE LRQRGDGSRE
HPKHSWKVRR PKDAPLDGER TRNLLAEWPD GGYLSDPFSY GLMKGAGVPT PRAQFVTLDV
NGEHQGVYVE LEEPDEKHFL LENGIDSNAN VYRCGLRDCE LKLTPPAHYQ GPWEKKTNES
EPSDDLDAFL IGLNRTPEGE IEAWLEQHVD LPRFFRFYAV GILISLSGID DSGSYLVHDR
TRDKWLWVPW DLNNAKLVFW RDNPVEWGVP FRYAIPFYTL YDAGTLGVAA GKEARYGGAH
PPFVVLFQRI WDRPALRNRI LDEVEAMLDG PFAEAETSPR IDGLRALIAG LLPADPWVDP
AHADASVQVL KDYVRRRTGF LREQIALERH RGEGGLVVNA IAPDAIELYN REDAPRDLGG
LALTGDLRQR LATLLPAGTV VPPHGTLRLP FPVAAEGGEV GIFDVASQLP VDAVYYGPPG
GRTYARTPDG AETWAWR