Gene Anae109_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3035 
Symbol 
ID5375391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3534910 
End bp3538071 
Gene Length3162 bp 
Protein Length1053 aa 
Translation table11 
GC content72% 
IMG OID640844560 
Productprotease domain-containing protein 
Protein accessionYP_001380216 
Protein GI153005891 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.559762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGTTC GCTCGTGTCT CGCCCTTGCC GTAAGCGCTG CGCTCGCCGC CTCGCCGGCG 
CTCGCCGATC CCCGCGCCCG CGATGTCGCG CTGAAGAGGC TACAGGATGC GGCAAAAGGC
ACCGCCGCCG TGTCGATGCA CAAGTCGACC GACGTGGCGC GGTTCGTGAG CCTGCCGCCG
GCTGCACGCG GGCTCGGCCA GGCCGCGGCA AGGACGGAGC GCGACAAGAA GGCGCAGTCC
GCCGCGTTCT TCCGCAGCTA CGGCGCGGCG ATCGGGGTCT CGGATCCCGC CGGTCTCCGC
CACGTCTCGA CCGTCGTCGA CGCGCTCGGC GAGACCCACC TCACGTACCG GCAGTTCCAC
GACGGCGTCC CGGTCTTCGC GGGAACGGTG AAGACGCACT TCGACGCGGC GCACAAGCTG
AAGGCGGTGA ACGGCACCGC GGTCCCGGAC CTCGCGGTCG TCCCGACGCC GACGTGGAGC
GCCGCGCAGG CGGCAGAGCT CGCGCTCTCC GCGGTCGTCC GTGAGCGGGG CCCCTCGGGC
ACCCTCGGGA TCGGCTCGAC GAAGCTGTAC GTCTATCGCG AGGGGCTGGC GAAGGGCGTC
CCGGGCGAAG CGCAGCTCGC GTGGGAGATC GAGGTGACCG ACGGCGCCGG CGTCCGCGAC
CTCGTCTACG TCGGCGCGCA CACGGGCAAG GTCGTCGAGA CCGTCGCCGG GATCCACGAC
GAGCTGAACC GCCGGGCGTA CGACGGGCAC GAGCTCGCGT TCGCGCCGAG GAGCTACCCG
AACGCCGCCT ACTGGACGGA GGGGCAGGGC TTCCCGACCG CGTCCGAGGA GGCGAACAAC
ATGATCGCCT CGTCGAAGGA GGTCTACGAC TTCTTCAAGA ACGCGTTCGG CCGCGACTCC
TTCGACGGCC AGGGCGCCAC GATGGATGCC ATCTTCGACC GCGGCTACAG CTGCCCGAAC
GCTTCGTGGA ACGGGACGTT CATCTCGTTC TGCCCGGGGA CGACCACCGA CGACGTGACC
GCGCACGAGT GGGGGCACGC GTACACGCAG TACACCCACG ACCTCATCTA CGCGTGGCAG
CCGGGCGCGC TCAACGAGGC GTACTCGGAC ATCTGGGGTG AGATCGTGGA CGCCATCAAC
GGCCGGGGCG GCGACGCGCC GGACGCGGCG CGCAGCGCGG GCGCCTGCTC GACCTTCTCG
CCGCCGGTCG CCAAGCTCGT CGTGACGGCG CCGGCCTCCC TCGCGGGCGA GTACTTCGCG
CAGTCGGCGT CGTTCGGCCC GAGGCTGACC GCGGCGGGGA TCACGGGCGA GGTCGTCGCC
GCGCTCGATC CCGCCGACGC CGGAGGCCCC AGCACCCTCG ATGCGTGCTC GCCTCTCACC
AACGCCGCGG CGGTCCTGGG CAAGATCGCC CTCGTGAACC GCGGCTCGTG CAACTTCACC
GTCAAGGTGA AGAACGCGCA GACCGCCGGG GCGGTCGCGG TGATCGTCGC GAACAACGCG
GCGAATGGCC TCCCCGGGAT GGGCGGGTCG GACGCGTCGG TGACCATCCC GTCCGTCGGC
GTGCAGAAGG CGACGGGCGA CTCCATCCGC GCCGCGCTGG CCGGCGCCGA GGTGGTGACC
GCGAAGCTGG TGGCCCAGCC CGGCAGCGAC GCGTCGGTGC GCTGGCTCAT GGGCGAGGAC
TCGGCGGCGT TCAGCGGCGC CCTCCGGGAC ATGTGGAACC CGACCTGCTA CTCGAACCCG
GGCAAGGTGA CGGACCGGGC GTACTACGTG TGCGACTACG CCGGCGACAA CGGCGGCGTG
CACACCAACT CGGGCGTGCC GAACCACGCG TTCGCCCTGC TGGTCGACGG CGGCTCGTAC
AACGGGCAGA CGATCGCCGG GATCGGGCTC ACCAAGGCCG CGCACCTCTA CTTCCGCGCC
GCGGACGTCT ACCAGGTCGA GGACAGCGAC TTCGCCGACC ACGCGACCGC GCTCGAGTCC
TCCTGCGCCG ACCTCACGGG GGCCACGCTC CCGGCGCTCA CGGGCGGCGC CTCGGGCGAG
ACCATCACCG CCGCCGATTG CGCGGAGGTG GCGAGGACGA TCGCAGCGGT CGAGCTGCGG
ACGCCCCCGA CCTTCTGCGG GTTCAGGCCG CTCCTCAACC CGCAGGAGAC GCCGCAGTGC
GGGGCGGGCA CGAACGGAGC GAAGCAGGCG ATTCAGCTGT TCTCGTTCGA CGCGGGGGCC
TCCGGCTGGA CCTCCGCGAC CTTCGCGGCG GCGCCCGGCG ACTTCACCCC GCGCGAGTGG
ACCTGGCTCG GCGCGCTCCC GAACGGCCGG CCGGGCTCCG CCTTCTTCGC CCCGGATCCC
AACATCGGCA CGTGCGCACC GGGCGGGGAC GAGACCGGGG TGCTCTCGCT GACGAGCCCC
GCCATCACGC TCCCGGGCAC CACCGAGTTC GCCCGCGCCA CGTTCTGGCA CTGGGTCGCC
ACGGAGGCGG GGTTCGACGG CGGGAACGTC AAGGTGAGCG TCAACGGTGG GCCGTGGCAG
CTCCTCCCGC CGTCGGCGTT CACGTACAAC GGGTACAACG CGCTCCTCGA GACGGCCGCC
GCCGGCAACA CGAACCCGCT CGCCGGTCAG CCGGCCTGGA CCGGCACGGA CGCCGGTACC
CTCAAGAGCG GGACCTGGGG ACGGTCGCAC GTGGATCTCG GCACGTTCGC GAAGGCCGGG
GACCGGATCC AGCTGCGCTT CGACCTGGGC ACCGACGGCT GCGGTGGCCG CGGAGGGTGG
TACGTCGACG ACGTCGAGGT GTTCAGCTGC ACGCCGAACG CGGCGACGGT GGCGGTCGAG
GACGCGGCGT ATCCAGAAGG CGACGCGGGC GTCGCGACGA GGGGCGTCAC CGTCCGGCTC
TCCGCGGCGA CGGTCCAGCC GGTGGCGGTC CGCTACGTGG TCGTCGACGG AACCGCCCAG
CACGGGAACG ACTTCGAGGT CGCTGCCACC TCCGGCACGA TCGTCGTCCC GGCGGGCAGG
ACGGCCGCGA GCATCGCCAT CGGCCTCAAG GGCGACATCG TGCCCGAGGG CGACGAGGCG
TTCCTGGTGA GGATCACCGG CGTCACCGGC GCCACGCTCG CCGACGGCGA GGCGGTGGTG
ACGATCCTCG AGGATGACGG AGCGCCTCCG GGGCAGAACT GA
 
Protein sequence
MQVRSCLALA VSAALAASPA LADPRARDVA LKRLQDAAKG TAAVSMHKST DVARFVSLPP 
AARGLGQAAA RTERDKKAQS AAFFRSYGAA IGVSDPAGLR HVSTVVDALG ETHLTYRQFH
DGVPVFAGTV KTHFDAAHKL KAVNGTAVPD LAVVPTPTWS AAQAAELALS AVVRERGPSG
TLGIGSTKLY VYREGLAKGV PGEAQLAWEI EVTDGAGVRD LVYVGAHTGK VVETVAGIHD
ELNRRAYDGH ELAFAPRSYP NAAYWTEGQG FPTASEEANN MIASSKEVYD FFKNAFGRDS
FDGQGATMDA IFDRGYSCPN ASWNGTFISF CPGTTTDDVT AHEWGHAYTQ YTHDLIYAWQ
PGALNEAYSD IWGEIVDAIN GRGGDAPDAA RSAGACSTFS PPVAKLVVTA PASLAGEYFA
QSASFGPRLT AAGITGEVVA ALDPADAGGP STLDACSPLT NAAAVLGKIA LVNRGSCNFT
VKVKNAQTAG AVAVIVANNA ANGLPGMGGS DASVTIPSVG VQKATGDSIR AALAGAEVVT
AKLVAQPGSD ASVRWLMGED SAAFSGALRD MWNPTCYSNP GKVTDRAYYV CDYAGDNGGV
HTNSGVPNHA FALLVDGGSY NGQTIAGIGL TKAAHLYFRA ADVYQVEDSD FADHATALES
SCADLTGATL PALTGGASGE TITAADCAEV ARTIAAVELR TPPTFCGFRP LLNPQETPQC
GAGTNGAKQA IQLFSFDAGA SGWTSATFAA APGDFTPREW TWLGALPNGR PGSAFFAPDP
NIGTCAPGGD ETGVLSLTSP AITLPGTTEF ARATFWHWVA TEAGFDGGNV KVSVNGGPWQ
LLPPSAFTYN GYNALLETAA AGNTNPLAGQ PAWTGTDAGT LKSGTWGRSH VDLGTFAKAG
DRIQLRFDLG TDGCGGRGGW YVDDVEVFSC TPNAATVAVE DAAYPEGDAG VATRGVTVRL
SAATVQPVAV RYVVVDGTAQ HGNDFEVAAT SGTIVVPAGR TAASIAIGLK GDIVPEGDEA
FLVRITGVTG ATLADGEAVV TILEDDGAPP GQN