Gene Anae109_3678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3678 
Symbol 
ID5374750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4288106 
End bp4289653 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content68% 
IMG OID640845199 
Producttransposase IS66 
Protein accessionYP_001380842 
Protein GI153006517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAGCG GCGATCACGA CTGCGAGTGG CGCGAGCGGG CCGAGAGCCT TGCCGTTCAG 
CTCGACGCCG CGCAGAAGAC CATCGCCTCC CAGACGCAGA CGATCCAGGC CCAGAGCGAG
TCGCTCACGA AGCTGAGCGA GCAGTTCGCA TCGGTAAAGG GCACCGTCGA GAAGCTCCAG
CGGCACGTCT TCGGGAAGCG CTCCGAGAAG ATTACGCCGC TCGCGACGGC GCTCCGTGAC
CCCGCGCGCG CCGAGGCCGA CCGCATCGCG GCGCTGCAGA CGCGGCGGGA GAACGCGGAG
AAGAAGCGCC AACTCGTCAC CCGGAAGATC GAGCACAAGG TCCGCGAGGA CCAGAAGGTC
TGTCCGAAGT GCGGAGGCCG AGACTTCTCA AAGCTCGGCG ACGGGGCCAT GAGCGAGCTG
TACGAGCTCG TTCCGGCGAT CGTGGAGCGA CAGCTCCACA TCCAGGAGAA GTTGCGCTGC
AAGTGCGGCG AGACCGTCAT CACCGCGGAC GGGCCGACGA AGGTCTTCGA CAAGGCCCGC
GTCGGACCCA CCTTCATGGC GCAGGTCGCG GTGTCGAAGT GCGCCGACGC GTTGCCCCTT
CATCGCCAGG CAAAGGCGTA CAGGCGCGTC GGCGTACAGG TGAACGACTC GACGCTCGGC
GACTACTTCC ACCGCACGGC CGAGATCACC AAACCCATCT CCGACCGGCT GCTCGACGTG
GTCGCGGAGA AGGAGATCGT CCTCGCCGAC GAGACGTCGC ACCGCGTGCA GGCGAAGGGG
AAGACGCGCC GCTCGTGGCT CTGGAGCTTC ATCGCAAAGG ACGAGGACGA GCGCGAGATG
ATCGCGTACG TCTTCTCGCC GAGCCGCTCG AGCGAGACGC CCGAGCGCGT GCTGGAGGGC
ACCTCCGGCA AGCTCGTCGC CGACGCGTAC AAGGGCTACG ACCGCGTGAC CATGCCCGGC
CGGCGTGTCC GCGCCGGCTG CCTCGCGCAC GTCCGCCGGA AGTTCTTCGA TGCCCAGTCC
GTCGCGCCCG ATGCCGCGAA GCAGGCGATG GACTTCATCC TCGAGGTCTA CAAGATCGAG
CGGGCCGCGC TCGACGCCGA TCTCCTCGGC ACGCCCGAGC ACCTGGAGAT GCGCCAGAGC
GCGAGCCGCG CGGTGATGGA CGACTTCAAG GCGTGGCTCG ACGCGGAGCA GCCACGGCAT
CCGCCGAAGG GGCCGATGGG CGAGGCGATC AACTACGCGC TCGGCCAGTG GGACGCGCTG
ACGCTCTTCC TCACCGATCC GCACCTGCCC ATCGACAACA ACGCCTCCGA GCGCGCGCTC
CGCGTGGCCG CGCTCGGACG GAAGAACTTC CTCTTCGTCG GCACGAACGA AGCCGGCGAG
AACCTCGCCG GCCTGTACTC GCTCATCGCG ACCTGCGAAG CGAACGGCGT GAACCCAGTC
GACTACCTCG CCGACGTGCT CATCCGCGTG CAGACGCACC CGGCCTCGCA GATCGACGAG
CTGCTGCCGC ACAACTGGAC GCCCCCACCG ATACGCGCGC CCTCGTGA
 
Protein sequence
MGSGDHDCEW RERAESLAVQ LDAAQKTIAS QTQTIQAQSE SLTKLSEQFA SVKGTVEKLQ 
RHVFGKRSEK ITPLATALRD PARAEADRIA ALQTRRENAE KKRQLVTRKI EHKVREDQKV
CPKCGGRDFS KLGDGAMSEL YELVPAIVER QLHIQEKLRC KCGETVITAD GPTKVFDKAR
VGPTFMAQVA VSKCADALPL HRQAKAYRRV GVQVNDSTLG DYFHRTAEIT KPISDRLLDV
VAEKEIVLAD ETSHRVQAKG KTRRSWLWSF IAKDEDEREM IAYVFSPSRS SETPERVLEG
TSGKLVADAY KGYDRVTMPG RRVRAGCLAH VRRKFFDAQS VAPDAAKQAM DFILEVYKIE
RAALDADLLG TPEHLEMRQS ASRAVMDDFK AWLDAEQPRH PPKGPMGEAI NYALGQWDAL
TLFLTDPHLP IDNNASERAL RVAALGRKNF LFVGTNEAGE NLAGLYSLIA TCEANGVNPV
DYLADVLIRV QTHPASQIDE LLPHNWTPPP IRAPS