Gene Anae109_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2067 
Symbol 
ID5373919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2340626 
End bp2342386 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content73% 
IMG OID640843580 
ProductTPR repeat-containing protein 
Protein accessionYP_001379254 
Protein GI153004929 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0230242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCTT CGAGCTCGAG CACGACAGCG GCCGGGCGCC CCTGGTCGAG GTTCCGCTCC 
GTGGCGATGG CCGCCGCGAT CGCGACGTTC GCGCTGGCCA TCTACGCCGG CGCGCTGCGC
AACGGGTTCG TGTACGACGA CATCCCACAG GTCGTGCAGA ATCCCTGGAT CCGCGACGCC
TCGAGCCTCC TCGCGGCGTT CACGTCCGAC GCCTGGGGGT ACCTGGGGAT CCACAGCAAC
TACTACCGGC CGATGATGCA CGTCGTCTAC GCGGCGGCGA ACGCGCTCTT CGGGCTGGAT
GCCCGAGGGT TTCACCTCCT CAACCTCCTG CTGCACGCGG CAGTCTCCGT GCTGGTCTAC
GCCACGTCAC TCCTGGTGCT GCGTGCGGCG CCCGGAGCCT CGCCGGCGCG CTCGCGCGTC
CTCGCCGCGG CGTCCGGGTT CCTGTTCGCG GCCCACCCGA TCCACACCGA GGTGGTGAGC
TGGATCTCCG CGACGACCGA TCTCTGCGTC GCCCTGCTCG CGCTGTCCGC GCTGCACGCG
TACGCCACGC TCCCGCCCGA GCGCGTCCCT TCCGCCTCTC CCCGCTACCT GTGGGCGGTC
ATCGCGTTCG CGATCGCGAC CCTCACCAAG GAGGTCGCCC TCGTCATCCC GGGGATCCTC
GTCGCCTGGG ACGTCTCCTT CCGGCGGCAC GCGGTCGTTC GGGTGCGGTG GCTCGCGGCG
TACGCGCCGT TCGCCGGGGT GATCGGGCTC TACTTCCTGC TGCGCTGGAG CGCGCTGGGG
GGCTTCGCCT CCATCTCGCG GCATCAGGAG CTGACGACCC TGCAGCTTGC TCTCAACGTC
TGCGCGCTGT TCGGGGCCTA CCTCGCGAAG CTCGTCGTGC CGTCGGGCCT CTCCGCGTTC
CACCCCTTCG ACCCGGTGGT GTCGATCGCG GACCCGCGTG CGCTCGCGGG CCTCGTCGCC
CTGGCGCTGG TGGTGGCGTT CGTCTCGATC GCCTGGCGGA GGAGGAGCGG CGCCGTCCTC
GTTGCGCTGG CGGTGCTGCT GCTGCCGCTC CTCCCGAGCC TGTGGCTCAC CCGCCTCGGC
GAGAACCCGT TCGCCGAGCG CTACCTGTAC CTCCCGTCGC TCGGATTCAT CTGGCTCCTC
GCCATCGCGG GACAGCGGCT CGTCGCGGCC AGGCCCCGTC TCGCGCCTGC GCTCGGCGCC
GCGGCCGTCG TCCTGTGCGC GACGTGGGCC TGGGGCGTCG CCGCGCGCCA ACCCGCCTGG
CGCGACGACG TCTCGCTCTG GAGCGACGCG GCCGCGAAGG CGCCCGGCGC CGCGATCCCC
CGCTACAACC TCGCGGTCGC GCTGGAAGCT GCCGGGGACC TGCCACGCGC GATCGCCGAG
TACGAGACCG CGCTGCGGCT CGAGGAGAGC CCGGTCGCGT GGACGAGCCT CGGCGCGGCG
TACCACGCGG CCGGGCGTGA CGAGGACGCC CTCCGCGCCT ATGGACGCGC GCTCTACTGG
GACGCCGCGA ACGTCACGGC GCTGAACGGC CTCGGCGCGA CGTACGTGAA GATGGGCCGG
GGCGCGGCGG CGATCGAGCC GCTCCGCGCC GCCATCGGGT TGGCGCCTCG GTTCGCCCCG
GCGTACCACA ACCTCGGGCT CGCGTACGAG CAGCTGGGCG ATTCGCGGGC TGCGATCGAG
AGCTACCGCG CCGCGCTGGG CGCGGATCCG TCCAGCGCGG CGTCGTACCA GCGTCTGCGC
GCGCTCACCG CGAGCCCGTA G
 
Protein sequence
MIPSSSSTTA AGRPWSRFRS VAMAAAIATF ALAIYAGALR NGFVYDDIPQ VVQNPWIRDA 
SSLLAAFTSD AWGYLGIHSN YYRPMMHVVY AAANALFGLD ARGFHLLNLL LHAAVSVLVY
ATSLLVLRAA PGASPARSRV LAAASGFLFA AHPIHTEVVS WISATTDLCV ALLALSALHA
YATLPPERVP SASPRYLWAV IAFAIATLTK EVALVIPGIL VAWDVSFRRH AVVRVRWLAA
YAPFAGVIGL YFLLRWSALG GFASISRHQE LTTLQLALNV CALFGAYLAK LVVPSGLSAF
HPFDPVVSIA DPRALAGLVA LALVVAFVSI AWRRRSGAVL VALAVLLLPL LPSLWLTRLG
ENPFAERYLY LPSLGFIWLL AIAGQRLVAA RPRLAPALGA AAVVLCATWA WGVAARQPAW
RDDVSLWSDA AAKAPGAAIP RYNLAVALEA AGDLPRAIAE YETALRLEES PVAWTSLGAA
YHAAGRDEDA LRAYGRALYW DAANVTALNG LGATYVKMGR GAAAIEPLRA AIGLAPRFAP
AYHNLGLAYE QLGDSRAAIE SYRAALGADP SSAASYQRLR ALTASP