Gene Anae109_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2051 
Symbol 
ID5374519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2321335 
End bp2324565 
Gene Length3231 bp 
Protein Length1076 aa 
Translation table11 
GC content75% 
IMG OID640843563 
Producthypothetical protein 
Protein accessionYP_001379238 
Protein GI153004913 
COG category 
COG ID 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain
[TIGR03382] Myxococcales GC_trans_RRR domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.890498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.473227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCAT CACTCCGGGT GGTCACCCGT CGCGTCGCGG CGCTCGCCTC CGCGCTCCTG 
TCCGCCGCAG CGGCGGCCGC CCCCGCGCCG CCCGGCGCCT TCTCCGTGCG GCAGCCGGAC
GGCTCGCCGC TCGCCGTGCG GGCGTACGGG GACGAGCACG GGATCGTCTT CGAGACGCTC
GACGGCTACG CGCTCGCGCA GACCGCCACC GGCGAGTGGC GCCTCGCGCG CCTCGCGCCC
GAGGGACGCC TCGTGGCCTC CGACCGGGCG GCGCACGCGG CGCGCGCGGG GTTCGCCCCG
CACCTGCGCC CCGCGCCCGC CGCCCTGCGG GAGATGGACG TCCGTCGCCG CGACGCGAAC
TCCGCCCGGG CGCCGCTCCC CGCGCTGCGC GACCGGGCGC GCGCGCTCGA CGGCCTCGCC
GCCTCCCGCC CGGGCGGGGG CGAGGCGGCC GGGCGGGCGC TGCTCGCCGC CGTGCCCCCG
GCCTCGTTCC GCCTCGCCGT GCTCCTCGTG GAGTTCCCGG ACGTGCCCCA CACGTACGGC
GCGGACGCCT TCCGCGCGCT GTTCTTCTCG GAAGGCACGT ACCGCAAGTC GCCGCTCGGC
TACGACGTGA CGGGCAGCCT CCGCGAGTAC TGGGCGGAGG TGTCGTACGG TCAGCTCTCG
GTCACGGGGG AGGTGTTCGA CTGGGTGCGC GCGGCGGCGC CGCGATCGGA GTACCTGAAC
GACTCGTGGC GCCTCCGCGA CGAGGCCATC GCGGGCTCGG GCGTCGATCT GTCGCAGTTC
GACGGCTACG CGCTCGTCTA TGCCGGCGAG GTCCAGTCGA GCGCGCTCTG GCCGAACGCG
CCGGGGAACT GGTACGTGAT GTCCGAGACG TTCTACTCGA GCACCGCGCT CGGCGTGGAC
CTCGCCGGCG TCGGCACCCA CTGTCACGAG TTCGGCCACG TGCTGGGCCT GCCCGACCTC
TACTACGGCG CCGGGAACAT CGGCACCTGG GGGCTCATGG GCAGCGGGAA CTACCTCGAC
GCGGGCCGGA CCCCGCCGCA CGTCGGGGCG TGGGAGAAGA AGCGCCTGGG CTGGCTCCGG
CCGGAGCGGC TCGCCGGCGG CTTCGCGGGT CCGCTGCGGC TCCCGCCGGT CGCCTCGGCG
CCCGCCGCGT TCGAGGTCCT GACCGAGCGC TCCACGTTCC TGCTGGAGAA CCGGCAGTGG
GTGGGCTTCG ACACCCACCT GCCGGGCCAC GGCATGCTCG TGTGGCACGT GGACGAGACC
CAGGCCAGCT ACACCACCGG CGACCACTGG ATCCTGGATC TCGTCGAGGC GGACGGGATT
CCCGCCTCCA CCTCGGCGGG CGGGACGCCG TTCCCGGGCG GCCGGACCGA GGGCGCGCTG
ACGTGCGCCA CGACTCCGAG CAGCGCGGAC TACGACGGGA GCTGCTCGTT CGAGCTCACC
GGCATCGTGG AGGACGGCGA CGACGTGCAC GCGGACGCGG TGGTCTCGTG GCGGAGGGGC
GTCGGGATCA CCGTGAACGG CACGGGCGAC CACCCCACCC TCTTCACGGC CTTGGCCATG
GCGCCCGCGG GGTCGACGGT GCGGGTGCCG GCAGGCACGT TCCGGGAGCT CGTGCGCCTT
CCGGACGCGG TGTCGCTCGC AGGCGCGGGG CCGGGGCAAT CGATCCTGGA GGCACTCGAC
GCCCGCCCCC TCGTGATGCC GGGCGAGGAC AGCGCCGTGT CCGGATTCAC CCTTCGTGCC
GCCGCAGGCG CCGACGCGTT CGGAGGCGAC GCCTTGCAGC AGGGCTACTC CACGGCGTTC
ACGGTGACGA ACTCGGCGTT CGTCGGGTTC TACACCGCCA TCGCGCTCTG GGAGGGATGG
CCTCCGAGCT GGAACCGCGA CTCGCGCGTG GCGCGGGTGA CGAACAACGT GTTCGACGGG
AACACGTACG GGCTGATGAT CACGTACTAC GACGGGTTCG TCCCGACGAT CCGCAACAAC
GCCTTCTTCC GGAACGACTA CGCCGTCCTC GCGAGCACGT CCTCCACCGG TGACCTCGGC
TACAACGCGT TCGCGCGCAA CGGCGTCGAC ATCGCCGCGG TCGACGGCTC CGGTCCCGGC
TCGAGCGACG TGCTCGCCGA TCCGGCGTTC GTGGACCCCG ACGCGGGCGA CTACCGGCTC
TCGCCCGGCT CGGCCTGCAT CGACGCCGGC GATCCTGCCG CGTCGTACAG CGACGTGGAC
GGCACGCGCG CGGACATCGG CGCCTTCGGC GGCCCCGGAG GGGCGCCGCC GAGGCTCGTC
GTGACGCGCT TCGGGAGCGG GTCCGGCCGC GTCGTCTCCG CTCCGCCGGG GATCGACTGC
GGGAGCACGT GCTCGGCCAC GTACGCGTCG GCGATGGACG TCACCCTCAC CGCCGAGCCC
GATCCCGGCT CGCGGTTCGA CGGCTGGGCG GGCGCGTGCG TGGGCACGGG CGACTGTGCT
CTGCGGGTGA GCGGCGAGGT CGCGGTGACG GCGCGCTTCA CCTCGACGGC GTGCCCGGCC
GCCGACGATT GCCACGAGGC GGTGGTGGAC GCGAGCGGCG CGTGCGTCTA CCCGGCGAAG
CCGGACGGCG CCCCGTGCGA CGACGGCGAC GCCTGCACCC GGGCCGACGC CTGCGCGGCC
GGGACGTGCT TCGGCTCCGA CCCGCTCGCC TGCGACTCGC CCGACCCGTG CGCCGCGTCG
ACGTGCAGCG CCCCGAGCGG AACCTGCCTC GCGGTCCCGG TGCCGGACGG CAGCGCCTGC
GAGGACAGGG ACCCCTGCAC CATCGGAGAG ACCTGCGCCG GCGGCGCGTG CGGGGGAGGA
GCGCCGGTCA CGTGCGAGCC CGCGGGCGAA TGCCACACGG CGGCGGCGTG CGAGATCGCG
GCGGGAGGCT GCACGTCCTC TCCGCTGCCG GACGGGACGC CCTGCTCGAT CGGCCTGTGC
ATCGGCGGGG TGTGCACGCG CGATCCGACC GGGATCTCCG AGCCTCCGCC GCCTGCCGGC
GGCGGCGCAC CGCCGCGAGG GGGCGGTGGC TGCGCGTCCG CAGGCGGCGG CGGCGAGCTC
GCGGGGCTCC TCGCGGCGCT CGGCGCCCTC GTCCGCGCCA GGCTCCGCGC ACCCATCGCC
GGATCCGCTC GCAAGGCCGG CGGTCAGTCC TGGTCGCCCG CGACCGCGGC GAGGAGCGCC
TCGCGCCGGC GCTCGAGCAG GCGCGCCGCG AACGGGAGCG CGACGCGGTA G
 
Protein sequence
MPSSLRVVTR RVAALASALL SAAAAAAPAP PGAFSVRQPD GSPLAVRAYG DEHGIVFETL 
DGYALAQTAT GEWRLARLAP EGRLVASDRA AHAARAGFAP HLRPAPAALR EMDVRRRDAN
SARAPLPALR DRARALDGLA ASRPGGGEAA GRALLAAVPP ASFRLAVLLV EFPDVPHTYG
ADAFRALFFS EGTYRKSPLG YDVTGSLREY WAEVSYGQLS VTGEVFDWVR AAAPRSEYLN
DSWRLRDEAI AGSGVDLSQF DGYALVYAGE VQSSALWPNA PGNWYVMSET FYSSTALGVD
LAGVGTHCHE FGHVLGLPDL YYGAGNIGTW GLMGSGNYLD AGRTPPHVGA WEKKRLGWLR
PERLAGGFAG PLRLPPVASA PAAFEVLTER STFLLENRQW VGFDTHLPGH GMLVWHVDET
QASYTTGDHW ILDLVEADGI PASTSAGGTP FPGGRTEGAL TCATTPSSAD YDGSCSFELT
GIVEDGDDVH ADAVVSWRRG VGITVNGTGD HPTLFTALAM APAGSTVRVP AGTFRELVRL
PDAVSLAGAG PGQSILEALD ARPLVMPGED SAVSGFTLRA AAGADAFGGD ALQQGYSTAF
TVTNSAFVGF YTAIALWEGW PPSWNRDSRV ARVTNNVFDG NTYGLMITYY DGFVPTIRNN
AFFRNDYAVL ASTSSTGDLG YNAFARNGVD IAAVDGSGPG SSDVLADPAF VDPDAGDYRL
SPGSACIDAG DPAASYSDVD GTRADIGAFG GPGGAPPRLV VTRFGSGSGR VVSAPPGIDC
GSTCSATYAS AMDVTLTAEP DPGSRFDGWA GACVGTGDCA LRVSGEVAVT ARFTSTACPA
ADDCHEAVVD ASGACVYPAK PDGAPCDDGD ACTRADACAA GTCFGSDPLA CDSPDPCAAS
TCSAPSGTCL AVPVPDGSAC EDRDPCTIGE TCAGGACGGG APVTCEPAGE CHTAAACEIA
AGGCTSSPLP DGTPCSIGLC IGGVCTRDPT GISEPPPPAG GGAPPRGGGG CASAGGGGEL
AGLLAALGAL VRARLRAPIA GSARKAGGQS WSPATAARSA SRRRSSRRAA NGSATR