Gene Anae109_3694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3694 
Symbol 
ID5376711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4305543 
End bp4307459 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content80% 
IMG OID640845215 
ProductTPR repeat-containing protein 
Protein accessionYP_001380857 
Protein GI153006532 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.663272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACCG TCGCGCGTCA CGTCCTCGTC CTCGCACTGC TCGCCGCAGC GGGCGCCCCG 
CGCGCTGCGC CCCTCACGCG GCCCGGCAGG AGCGCGGACG GGGCCGGCTC GAAGCGCTGG
GCCTCGGCGC GCGCCATCGC CGGCTACCTC GAGGCGCATC GGCGTGCGCG GGCGGGTGAC
CCGCGCGGCG CCGTCGACGC CCTCAGGCTG GCGGTCGCGC ACGACGGGAC GAGCCCCGAG
CTGCGCGTCT CGCTCGCCGA GGCGCTGCTC GAGCTGGACC GCTTCGACGC GGCGGAGGCC
GAGGCGCGCA AGGCCCTGGA GCTCGCGGGC GGCGCCGGCC GGACGGCCTC GGAGGCGCAC
GTGCTCCTCG CCCGGCTCGC GGCGGGGCGC GATCGCATCG AGGAGGCGAC GCTCGAGCTC
CGCCGCGCCG TTCGCCTCGA GGCGGACCTC GCCGCGCGGG GCGAGCGCGC GGACGCGCTG
CCGTGGAGGC TCCTCGCGGA CCTCTACCTC GACCTGGGGG ACGAGGCCGC CGCGGCGCGG
ACGCTGGAGG ATCTCGGCCC GCACGCGCCG GCCGAGGCCG CCGCCGGGCT GCGGGAGCTG
GGGCGCGCGC TGCTCGATCG CGGTCAGCCG GGCCGCGCCG AGCACGGCCT GCGGCGCGCC
GCGGAGCTGG ATCCGGCCGA GGTGGAGGCG CTCCGCCTGC TCGCGGCGGC GCACGAGGCG
CTCGGGCGTG ACGGCGAGGC GCGCGACGAC CACCTCGCGG TCCTGCGGCG AGAGCCGGAC
GACGCCGCCT CGCTCGTCGC CCTCGGTCAC CTCGCCGCGC AGGCGGGCGA CGCCGAGCGG
GCGCGGGAGT GGTTCCGCCG CCACGCGCGC GCCGCGGGGG ACCGGTCCGA GGCGCACCTG
CGCATCGCGT TCGAGTGGCT CGAGGCCGGC CGGCCCGCCG ACGCGCTCGC CGCGGCGCGC
CAGGGGCTCG GGGAGGTGGG CCCCGACGCG CGGCTCCGCT TCGCGGAGGG GCTGGCGCTG
CGCGAGCTGC GCCGCTACCC GGAGGCCGCG ACGGCCCTGC AGGCCGTCCC CCCGAGCGCG
GACGTCTACT GGATCCCGGC GCGGGTGGCC CTCGCCGACG CGCTCTCCCG CGCCGGTCGT
CACGCCGAGG CGGAGCGCGC GCTGGCGGCG CCCCTCGCCG ACTTCCCGAA GGACGTCCGG
CTCGTCCTCG CGCGCGCCGC GGCGCTCTCC CGCGCCGGCC GGCGCGCCGA GGCGGTCGCT
CTCCTGAGGG GGGCGGCGAG CGAGAAGACC CGGGCGAAGG CGGAGGTGGC CGAGCTCACG
GCCGCGCTCG CCGACGCGCT CGTGAGGGCG GGGAAGGCGG CGGAGGCGGT GTCGGCGCTG
CGCAGCGCGC TGGCCTCGGA TCCCCGCGAC CAGGCGCTGC TCTACGCGCT GGGCGCCACG
TACCACCGGG CCGGGCAGCT CGACGCGGCC GTCGCACAGA TGCAGGCGCT CCTCGCGCTC
GTCCCGGATC ACGCCGAGGC GCTGAACTTC ATGGGCTACG CGCTCGCCGA GCGCGGCACC
CGGCTGGACG AGGCGGAGCG GCTCGTCCGG CGCGCGGTCG AGCTGAGACC CCGCTCCGGT
CACGTCCGCG ACTCGCTCGG GTGGGTGCTC TTCCGGCGCG GCGAGTACGC GCGCGCGGCG
GAGGCCCTCG AGCAGGCCGA CGCGCTCGCC GGACCCGACG CCGTGATCCT GGAGCACCTC
GGGGACGCCT ACCGCGCCCT CGCGCGGACC GCCGACGCGG CGCAGGCGTA CCGGCGCGCC
CTGGGCGCGG GAGAAGACGG CGGTGAGGAC GCCGACGACG GCCCGAGGCG GAGAGCGGGG
ATCGAGCGCA AGCTGCGCGA GCTCGGCGCG ACGGCCCGAT CCCGCGCGAC CCCTTGA
 
Protein sequence
MRTVARHVLV LALLAAAGAP RAAPLTRPGR SADGAGSKRW ASARAIAGYL EAHRRARAGD 
PRGAVDALRL AVAHDGTSPE LRVSLAEALL ELDRFDAAEA EARKALELAG GAGRTASEAH
VLLARLAAGR DRIEEATLEL RRAVRLEADL AARGERADAL PWRLLADLYL DLGDEAAAAR
TLEDLGPHAP AEAAAGLREL GRALLDRGQP GRAEHGLRRA AELDPAEVEA LRLLAAAHEA
LGRDGEARDD HLAVLRREPD DAASLVALGH LAAQAGDAER AREWFRRHAR AAGDRSEAHL
RIAFEWLEAG RPADALAAAR QGLGEVGPDA RLRFAEGLAL RELRRYPEAA TALQAVPPSA
DVYWIPARVA LADALSRAGR HAEAERALAA PLADFPKDVR LVLARAAALS RAGRRAEAVA
LLRGAASEKT RAKAEVAELT AALADALVRA GKAAEAVSAL RSALASDPRD QALLYALGAT
YHRAGQLDAA VAQMQALLAL VPDHAEALNF MGYALAERGT RLDEAERLVR RAVELRPRSG
HVRDSLGWVL FRRGEYARAA EALEQADALA GPDAVILEHL GDAYRALART ADAAQAYRRA
LGAGEDGGED ADDGPRRRAG IERKLRELGA TARSRATP