Gene Anae109_3756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3756 
Symbol 
ID5375897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4379149 
End bp4380699 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content71% 
IMG OID640845278 
Producttetratricopeptide TPR_4 
Protein accessionYP_001380919 
Protein GI153006594 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0867818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGA AGACTCGGAT CTCGGTCGTC CTGCTCGCCC TCACCGCGGC CTGTGCAACG 
GGCGGCGGCG GGAAGGGGCC CAGCGAGCGG TTCTACCAGG CGACCTACAA GCTCCCCGCC
CCCAGCCAGC TCGAGGACGC CGAGCGCGGG AAGATCAAGG ACGCGGGGAC GCACTACGAT
CGCGGCCTCG TGGCGCAGCA GTCCGGCAAC ATCGACCAGG CGCGCGCCGA GTGGGCCACT
GCCGCCCAGG GCTACGCCGA CTTCGCCGAT CAGTTCCAGT CGTCCGAGTG GCGCCTCCCG
ATCCGCTTCC GCGCCGCCGA GCTCTACATG CAGGCGCAGC AGTTCGAGCG CGCCGCGGAG
CAGGCCCAGA AGGTGGTGGC CGATCCCCAG TCGGACGCCT CGTCGAAGGC CGTCGGCTCG
CGGCTCGCCG CCGGCGCGTG GCTCAACGTC GCGAACCAGA AGGTGAAGGC GAGCCAGCTC
GAGCCGATCC GGCTCGCGAA CGCCGACCAG CGCCGAGGGC AGCCCCTGCA GCCGCGCGTC
CCGCCGGGAG AGTGGAAGCG CTTCGTCGAC TCCGCGGACG TGTACCTCCA GAACCTCGAG
GCGGACCCGG AGACGAAGAA GCCGGCCGCC GAGCGCCGCG GCGGCCTGCC GCCCGCGCAG
CTCGCGCTCA TCGCCGCCGA GGTCGAGTAC GCGTTCGACA ACATGGAGGA CGCCCGCCGC
CGCTTCGCGG ACATCCTGAG CCGCTGGCCG GAAGAGGGGG AGGTGCTGGA GAGCGCGGTG
CCGCTCTACC TCCAGACCTT CCTGTTCGCG AACGACGACC AGGGCTACCA GGCCGAGGTC
GCCCGCATCC GCGAGCAGGT GCAGGCCCAG GCGCAGAAGG CGACGGACCC GAAGCAGAAG
GAGAGCTACG ACAAGGTCCT CGAGGCGCTC TCCCGCGCCG AGGCGGGTAC CCACTTCGCG
GCCGCCCAGA AGCTGCTCGA CGAGGGCAAG CCCGCCGAGG CCGCCCAGGC CTTCGAGAAG
CTCGCGGCCG ATCCGCGCGG CGGCGACGCG GCGAACGCGC TCCACAACGC CGCGGTGGCC
TGGGACAAGG CCGGCAAGGC GGATCGCGCC GCCGAGATCC GCGAGCGGAT CCTGAAGGAG
CACGCGGACA GCAAGGTCGC GGGGAACAAC ATGCTGCTCC TCGCCGTCAA CAAGTCGAAG
AAGAACGACC ACTCGGGGGC GGCCAAGCTG TACGACGACT TCATCGCGAG GTACCCGGAC
TCGCCGAACC GGTGCGTGGC CCTCCAGAAC GTCGCCTCCG AGCTCGACCT CGCGAAGAAG
GCGGCGCCGG CCGCGGAGCG GTACGTCACC TTCGGCAAGG ACGAGAAGTG CGCGAGCGCC
GACCCGAACG TCGCCGCCCG CGCGCTGTAC CGGGCCGGCC GCCTCTACGA GGACGCGAAG
CAGAAGGCGA AGGCCAAGGA GGCCTACGCC GCGGCGATCG CGCTCCCGGG GGTGACCGAC
ACGGTCGCGA AGAGCCAGCT CGACGACGCC AAGCGCCGGA TGAAGAAGTA G
 
Protein sequence
MTSKTRISVV LLALTAACAT GGGGKGPSER FYQATYKLPA PSQLEDAERG KIKDAGTHYD 
RGLVAQQSGN IDQARAEWAT AAQGYADFAD QFQSSEWRLP IRFRAAELYM QAQQFERAAE
QAQKVVADPQ SDASSKAVGS RLAAGAWLNV ANQKVKASQL EPIRLANADQ RRGQPLQPRV
PPGEWKRFVD SADVYLQNLE ADPETKKPAA ERRGGLPPAQ LALIAAEVEY AFDNMEDARR
RFADILSRWP EEGEVLESAV PLYLQTFLFA NDDQGYQAEV ARIREQVQAQ AQKATDPKQK
ESYDKVLEAL SRAEAGTHFA AAQKLLDEGK PAEAAQAFEK LAADPRGGDA ANALHNAAVA
WDKAGKADRA AEIRERILKE HADSKVAGNN MLLLAVNKSK KNDHSGAAKL YDDFIARYPD
SPNRCVALQN VASELDLAKK AAPAAERYVT FGKDEKCASA DPNVAARALY RAGRLYEDAK
QKAKAKEAYA AAIALPGVTD TVAKSQLDDA KRRMKK