Gene Anae109_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3930 
Symbol 
ID5377012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4588694 
End bp4591618 
Gene Length2925 bp 
Protein Length974 aa 
Translation table11 
GC content79% 
IMG OID640845455 
ProductTPR repeat-containing protein 
Protein accessionYP_001381093 
Protein GI153006768 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCCT CCATCGTCGA GAAATACGAG CAGATCCTCG CAGCCGATCC CCGCTCGCGC 
ATCTTCGTCG AGCTCGCGAA GGCGCTCGTG GAGCGCGGCG ACGCGGCCCG CGCCGTCGAG
GTGTGCCGCC GCGGGCTCGA GCACCACCCG TCGTCGATCC TGGGCCGGGT GGTCTGGGGC
CGCGCGCTCC TCGACACGGG CGACGCGGGC GGCGCGGTCG CGCAGTTCGA GGCAGCGGTC
GCCGTCGATC CGGCGAGCCC GTACGCCTTC AACCTCGTCG GCGAGGCGCT CGTCGCGAAG
AAGCACTTCG CCCAGGCGCT GCCGTACCTC GAGCGGGCCG TCGAGCTGCA GCCCGCGGAC
GCGCGCGTGC GCGGCTGGCT CGACCAGGCA CGCCGGCAGC TCGGCGTGGG AGCCGTGACG
CCGCAGCCGG GCCCGAAGCA GGCGCCCGAG CAGGAAGACT CGGAGACGAC GATCCAGGTC
GGGTTGAGGA GGCGCTCGAG CTCGGCCGCG ACGCCGGCGC CGACCTCGGC CCCCGCCCCG
GCCGCGACCG GGACCGCTAT CTCGACCGCG ACCTCGACCG CGATCCCCAC CGCGACCTCG
ACCTCGTCTC CTGAGACCGC TCATCCCGAG CCCTTCGACT CCGCGGCCTC CGGCCGCTCC
GCTCAGGACA GGCTGCGCGT CTCCGCGACA GCGGAGACGG GGACTCGAGG GGCCCCGCCC
GTGCTCTCGC GCGTGGCGCC GGCGGGCGAG ACCTCCGCCG CGGGTACCGT CCCGGGCGCG
GGAGCGACCG CTCCCATCAC CCCCCCGCCG CTCCGCGCCC CCGCGACGCC TCCCCCGCTC
TCGCCGGATC CGTCGCGCTC GGTGCTTTAC ATGATCCCGA CCGAGACGAC CCGGGACGTC
ATCGGTCCGT CCGCCGCCGG CCGCGTGGGC GCGCTGCGCG CGCCCGCGGG CGCGGCGCCC
GACGCCGCCG AGGCGGATCG GCTCGCGCAG CAGTTCGCCG ACGAGCTTCG GAAGAAGCTG
CTCGCGGAGA CGCCTCCGCC CGCTCCCTCA CGGCTTCGCC GTAGCCGCCG CCTGCTCGGC
GCCGCCGCGA TCGTGCTCGC CGTCGGCGCG GCGGTCGCCG TCTACCTCGT GGCGGACGCC
AAGCGCGCCA CCGCCCTCGC CGCCACCGCC GGCGTCCGGG CGCGCGCGGG GCTCGCCCGC
GACACGCTCG GCTCGCTGCG CGAGGCCGCC CGCCTCCTGG AGGAGGCGCG CCGCGCGTCC
GAGGATCCGG AGCTCGTCTC CCTCTCCGCC CAGGTCGCCG CGCTGCTCGC CGTCGAGCAC
GGCGACGAGG CCGCCCGCAA GCTCGCGGCG GAGCTCGCCC GCGATCCCCG CTCCGGCGAC
GGCGGCCGCG TCGTGGACTA CCTCCTCGCC GCGTCGCCCG CCGAGGCGAA GGACGCGGAG
GGGTCGCTCC TCGACGCGCG TCCGTCGTCC GCCCCGCTGG TCCAGGAGCT CGCGGGGCGC
GTGCTCGTCG GGCGCGGCGA GCTCGAGGCC GGCCGCGGTC GGCTCGAGAT CGCCGCGCGC
GCGAACCCGC CGCTCCTGCG CGCCCTCTCG GAGCTCGGCG ACCTCTCCCT CCGCGCGGGC
GACGCGGACC GCGCCCTCGC GCTCTACCGC GCCGCGCTCG GCGCGCACCC GACGCACCCG
CGCTCGGTCG TCGGCGCGGC CGAGGCCCGG CTCGCGACCC GCCGCGACCT GGCGGACGCC
GCCCGCGAGC TCGCCGCGGT GGAGGCCGAT CCCGCGAGCG CCCCGCCGCA GGATCTCCGG
CTCCGCTTCG AGCTGGCCGC TGCCCGCGTC GCGCTCGCGC GAGGCGAGCC GGCCGAGGCC
GCGGCGCGGC TCGCGCGCGC GACGGAGTCG CTCGGGCAGA CCGGCACGCT CGCCGCGGCG
CTCGCCGAGG CCCACCTCGC CGCCCGCGAC TGGGACCGCG CGGAGGCCGC CGCCGCGCGC
GCCGTGCAGC TCGAGCCGCG CGAGGTCTCG CATCGGCTCC TGCTCGCCCG CGCGCAGGTC
GGCAGGGGCC GGCCCGCCGC CGCCCTCGCC GCGCTCGCCC CGGTCGACGG CCGCGCGGTC
CGGATCCAGC GCGCCATCGC GAGGCTCCGG CTCGGCCAGC CCGCGGCGGC CCGGCGCGAG
CTGGAGGCGA CGGGGCGAGA CGGCCGTATG CCGGCCGAGG CCGCGGTCTG GTACGCCCTC
GCCGACGTCG CGCTGGGCCG CGCCCCCCGC GCCCGCGCCC TGCTCGAGAA GCTCTCGCTG
GCACGCACCC CGCCCCCGCT CGTGCACGCC GCCCTCGGCC GGGCGCTCGA GGCCGAGGGC
AACCTCGCCG GCGCCGAGCG CGCCTATCGC ACCGCCGCCG AGCGCGAGCC CGACGCGCCG
GAGGGGCTGG CCGGGCTCGG CCGCGTGCTG GTCGCGCGGG GGCACGCCAA GGACGCCATC
GCGCCGCTCG AGGACGCGGT GCGGATGGAT CCCGCCGACC TCGAGGCGCG CCGCGCGCTC
GGGGAGGCCC GGCTCGTGGC GGGGCTCGCA GGCTCCGCGC GCGCGGAGCT CGACGCCGTG
CTCCTCGCGC GGCCGAACGA CGTCCCGGCC CTCACCCTCC TCTCGGCCGC GTGGCTCGCC
GAGGGCGAGG CCTCCGAGGC GCGGCGCGCC GCCGAGCGCG CCGTCGCCGC CGGACCGCGC
GACGGGCGCG CGCTCCTCGC CGCGGCGCGC GCCGCCCACG TCGCGGGCGA TCTCGCCGCC
GCGAAGCCCC TCGCCGCGCG CGCGCTCAAG GCAGGGCTGC GCGGCGACGA CACGGTCGAG
GCGAAGAAGA TCCTGACGTT CGCCGGCAAG CAGGTCGCGA TGAACGCCCC CTCGTCGGCG
AAGAAGGTCC CCGCGGCGGT CAGCAAGCCG GCGCGGCGCC GCTAG
 
Protein sequence
MPPSIVEKYE QILAADPRSR IFVELAKALV ERGDAARAVE VCRRGLEHHP SSILGRVVWG 
RALLDTGDAG GAVAQFEAAV AVDPASPYAF NLVGEALVAK KHFAQALPYL ERAVELQPAD
ARVRGWLDQA RRQLGVGAVT PQPGPKQAPE QEDSETTIQV GLRRRSSSAA TPAPTSAPAP
AATGTAISTA TSTAIPTATS TSSPETAHPE PFDSAASGRS AQDRLRVSAT AETGTRGAPP
VLSRVAPAGE TSAAGTVPGA GATAPITPPP LRAPATPPPL SPDPSRSVLY MIPTETTRDV
IGPSAAGRVG ALRAPAGAAP DAAEADRLAQ QFADELRKKL LAETPPPAPS RLRRSRRLLG
AAAIVLAVGA AVAVYLVADA KRATALAATA GVRARAGLAR DTLGSLREAA RLLEEARRAS
EDPELVSLSA QVAALLAVEH GDEAARKLAA ELARDPRSGD GGRVVDYLLA ASPAEAKDAE
GSLLDARPSS APLVQELAGR VLVGRGELEA GRGRLEIAAR ANPPLLRALS ELGDLSLRAG
DADRALALYR AALGAHPTHP RSVVGAAEAR LATRRDLADA ARELAAVEAD PASAPPQDLR
LRFELAAARV ALARGEPAEA AARLARATES LGQTGTLAAA LAEAHLAARD WDRAEAAAAR
AVQLEPREVS HRLLLARAQV GRGRPAAALA ALAPVDGRAV RIQRAIARLR LGQPAAARRE
LEATGRDGRM PAEAAVWYAL ADVALGRAPR ARALLEKLSL ARTPPPLVHA ALGRALEAEG
NLAGAERAYR TAAEREPDAP EGLAGLGRVL VARGHAKDAI APLEDAVRMD PADLEARRAL
GEARLVAGLA GSARAELDAV LLARPNDVPA LTLLSAAWLA EGEASEARRA AERAVAAGPR
DGRALLAAAR AAHVAGDLAA AKPLAARALK AGLRGDDTVE AKKILTFAGK QVAMNAPSSA
KKVPAAVSKP ARRR