Gene Anae109_2263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2263 
Symbol 
ID5375620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2569724 
End bp2570734 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content74% 
IMG OID640843781 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001379449 
Protein GI153005124 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.164031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0202941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAGCGG GGGCTTGCAC CCACACGCGC TCGAAGGGCG GCCGCGCCCG CCGCCCAGGG 
AATTCGGGGG TCGTCACCCC CGTTGGACCG ATCGTGCTCA AGACCCGACT CCTGTCGAGC
CTCGGCGGGG CCGCCGCCGT CGCCGGCCTC TTCTTCGCCG CCTCTCCGCT GCTCGGGCTG
TCGCCCGCCG AGCGCGCCAC CAATCCGCGC GCGGACACGG TCCACAAGGT GTTCCCGAGC
GCGGTGCGCA TCCAGATCTC CGCGGCGGGC GAGGTGGTCC GCAGCGCCTC CGGCATCGCC
TTCGCGCGGA GCGGCGAGCG GACGTACGTC CTGACGAACG CCCACGTCGT CGCGAACAAG
CGGACCTGGA AGGACCCGGT GCGCGTCGAG GTGCTGCCCA GCGAGGGCCA GGGCCGGGTC
CTCGCGAAGG TGGTCGCGAC GGGTGCCCTG CCGGACACCG ACATCGCCGT GCTGGAGGTG
CAGGGCTCGC TCCCCGTGAC GCCGCTCGGG CCCGACGACG AGCTGGAGCT CGGCGACGAC
CTCGTCGTGA TCGGCGCGCC GTTCGGGAAG GGGCTCTCCG TCGCGGCGGG GATCGTCTCG
CAGGTGGAGT ACGAGTTCCT CGAGAACGCA GCGGCGCCGC GGCGCGCCAA GTCGCTCAAG
ACGGACGCCG CCATCGGCTA CGGCAGCTCC GGGGGCGGCG TGTTCGACGT GCCGCGCGGC
CGGCTCATCG GCCTGGTCGA GGGCTATCGC ACCGCGCGCG TCGAGTTCGG CAAGGACGCG
AACCAGTACG CGTTCGACGT GCCCATGCCG GGCGAGACGT TCGTCGCCCC GGCCGCGAAG
ATCCGGCGCT TCCTCGCGGA CAAGGGGTTC GCGCATCTCG CCGATGGCCG TCCGGACGAG
GTCGCCGCGC GCGAGAAGGA CCTTTCGCAG GGAGAGCTCG CCTCGAAGCA GGGCGAGGTG
GCCGCGCGCG CTCAACCGGT CGCGCTGCCG GCCGCGCCCG CGGGCATGTA G
 
Protein sequence
MRAGACTHTR SKGGRARRPG NSGVVTPVGP IVLKTRLLSS LGGAAAVAGL FFAASPLLGL 
SPAERATNPR ADTVHKVFPS AVRIQISAAG EVVRSASGIA FARSGERTYV LTNAHVVANK
RTWKDPVRVE VLPSEGQGRV LAKVVATGAL PDTDIAVLEV QGSLPVTPLG PDDELELGDD
LVVIGAPFGK GLSVAAGIVS QVEYEFLENA AAPRRAKSLK TDAAIGYGSS GGGVFDVPRG
RLIGLVEGYR TARVEFGKDA NQYAFDVPMP GETFVAPAAK IRRFLADKGF AHLADGRPDE
VAAREKDLSQ GELASKQGEV AARAQPVALP AAPAGM