Gene Anae109_0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0099 
Symbol 
ID5376478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp121028 
End bp122860 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content75% 
IMG OID640841613 
Productspore coat protein CotH 
Protein accessionYP_001377303 
Protein GI153002978 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.158821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCCTCC CCGCCGGCGG CCTCCCGGTT CGTCCCCGGC GGATCGCACG TTGCCCCGTG 
AACAGGCAAG CCCCCTTCCT CGTGGCCGCC ATCCTCTCGC TCGCCTGCGG CCGCGGCGAC
GGGACGACCG TCGTGCGCCC TCCCGACGAG CAGCCGCCGC CGTACCAGCC CCCGCAGCAG
CAGCTCCCTC CCCCGGGGCC GTCCTGGCCG GTGGCGAGCG CCCCCGGCGC CTTCCCCACC
CCCGAGCCGC GCCTGCGCCG CGTGGATCTC ACGCTCGCGC CGGGCGACCT CGCCGCGCTC
GAGGCGGAGC CGACCTGGGA CGTCATGTAC CCCGCGACCG TGGCGCTCGA CGGCCGCGCG
GCCGAGGGGC TCGTGCGCTT CCGCGGCGCC TCGTCCCGCA CGCGCCCGCA GCGGAGCTGG
CGCATCGACC TCGACCCGGG CTACGCGCTC GAGGGGCGCG ATCGCTTCGC GCTGCTCGCC
GAGTACGACG ACGCGGCGAA GCTCGTGGAG CGCTTCGCGG TGGACCTCTA CCGCGCCCTC
GGCCTCCCCG TCCCCTTCGC CCGCTACGTG AAGCTCTACG TGAACGGCGC CTACCGCGGC
GTGTTCCTCG ACATGGAGCG GGTGGGCGGC GAGTACCTCG TCCACCACGG GCACGAGACC
GACGCCTCCC TCTACCGCTG CGGCGGCCGC AACTGCGAGC TGAAGCTGCT CCCCCGCTAC
GAGCCGGCGT ACCAGCAGGA CTTCACGAAG ACGCGCAACG AGGCGCTCCC CTGGGACGAC
CTCGACCGCC TCCTCGAGAC CGTCAGCCGG ACGGACGACG CGGAGCTCGA GCGGCGGCTC
GGGGAGGTGA TGGACCTCGA CGCCTACCTC GGCAACCTCG CGGCGGACGA GCTCATCGCG
AACACCGTCC TCGAGGACGC CCGCGGCTAC TGGCTCCACG AGCTGCACCG CGACGTGTGG
ACCTACGTCC CGTGGGATCT CAACAACTCG CAGGCCTACT GGTCGCGCGA CATCGCCGCC
AACGCGCCCC TCGGCGGCGT CTGGCGCGAG GCGCAGATCT TCTCCATCTA CGACGACGCG
GTGCGGCGGA TCTACGACGT GCGGGTGACG GAGCGCGCCG GGCAGAAGCC GACCTGGAGC
GTGCTCGCCA CGCGCGTCTG GGATCACCCG GCGCTGCGCG CGCGCCTGCT CGCCAAGCTC
GAGGCCGCCC TCGCGGGCCC CTTCACCCCG GAGCGGGTGG GCCCCTACCT CGAGGCGCTC
TGGGCGGCGG CGGGCCCGGA GATCGTGACC GATCCGTTCG TGGACGCGAC GCGCGCGCAG
ACGGTGCCCT CCCGCCTGGT GACGTTCGTC CGCGATCGCC GGAGCGTCCT CCTCGGGCGG
CTGGCGAGGC TGCGCGCGCA CGGCTCGGGT CCGCTCGTGG TGAACGAGGT CGGCATGCCC
GGCGGGACGG GGCCCGGCTA CGTGGAGCTG TACAACCGGG CCGACGTCGC GCTCGACCTC
TCGGGCGCCC ACGTCACCGA CGACCTGCGC GCGCCGGGGA AGTACGCGCT CCCCGAGGGG
ACCACGATCC CGCCGCGCGG GCACCTGCTG CTCGTCGCGG ACGGCACGCC CTGGCCGCCG
CCGGCCGGGG CGCCGGCGGG CGAGCCGCTG CGCCTGCCGT TCGTCCTCTC CCCCGCGGGC
GGCGAGCTCG GCGTGTTCGC GCCGGGGACG CTGCACGCTC CGTACGATCT CGTCTACTTC
GGCCCGCGCG CGGAGGGGGG CGCCTACGGG CGCACCGGCG ACGGCGTCGA GGCGTGGTCG
GACGTGGCGC GCACGCCGGG GGCGCCGAAT TGA
 
Protein sequence
MVLPAGGLPV RPRRIARCPV NRQAPFLVAA ILSLACGRGD GTTVVRPPDE QPPPYQPPQQ 
QLPPPGPSWP VASAPGAFPT PEPRLRRVDL TLAPGDLAAL EAEPTWDVMY PATVALDGRA
AEGLVRFRGA SSRTRPQRSW RIDLDPGYAL EGRDRFALLA EYDDAAKLVE RFAVDLYRAL
GLPVPFARYV KLYVNGAYRG VFLDMERVGG EYLVHHGHET DASLYRCGGR NCELKLLPRY
EPAYQQDFTK TRNEALPWDD LDRLLETVSR TDDAELERRL GEVMDLDAYL GNLAADELIA
NTVLEDARGY WLHELHRDVW TYVPWDLNNS QAYWSRDIAA NAPLGGVWRE AQIFSIYDDA
VRRIYDVRVT ERAGQKPTWS VLATRVWDHP ALRARLLAKL EAALAGPFTP ERVGPYLEAL
WAAAGPEIVT DPFVDATRAQ TVPSRLVTFV RDRRSVLLGR LARLRAHGSG PLVVNEVGMP
GGTGPGYVEL YNRADVALDL SGAHVTDDLR APGKYALPEG TTIPPRGHLL LVADGTPWPP
PAGAPAGEPL RLPFVLSPAG GELGVFAPGT LHAPYDLVYF GPRAEGGAYG RTGDGVEAWS
DVARTPGAPN