Gene Anae109_4144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_4144 
Symbol 
ID5376778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4850989 
End bp4854363 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content80% 
IMG OID640845671 
ProductZinc finger-domain-containing protein 
Protein accessionYP_001381306 
Protein GI153006981 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID[TIGR02098] MJ0042 family finger-like domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000494225 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGGTCG GCTGCCCGCA CTGTCACGCC GCGTACAACA TCGACGACCG CCGCATCCCG 
GCGACGGGCC TGAACGTGCG CTGCCCGAAG TGCCGGGAGA CCTTTCCCGT GCGTCCCGCC
GACCCCGCCG GGCAGGGCGC TCCCGTCCCG CTCCCAACCG GCACCGGCCA GGCGTCCGGC
GCCGTTCCGC TCGCGCCCCC CTCCAGCGCG AGCGAAGGTC GCCCCGGCGC CGCGCCGGGC
GTTCCGCTCC CGCCGCCCGC GGCGCCGGCC TCTGGCGGCG TCCCGCTGCC GGCCCCCCGC
GCCGCTCCCG CCGTGGCCCC GGACCTCGTC ACCTCGGCCG GCATCCCGCT GCCGCCGCCC
ATCCCCGCAG GCAACGCATC CACCGACCCG TTCGCGGCCC CGCCCGGCGC GGATCCGTTC
GGGATGGACG TGCCGGACGC GACCGCCGAC GCCGTCGAGG CCGCGCCCGA GGGCGAGGCC
CTCGGGTTCG GCGAGGTCGA CCTCGGCGGG GGCGCGCCCG CGGCGCGCGC AGCTCCCTCG
CTGGACGACA CGGATCCGTT CGCGCCCGCG CCGGGAGCGA CGTCCTGGCC TCCTTCGAGC
GCCGCGCCGG CGGCGCAGCC GTTCGCCGCG GCGAGCGGCG CCGATCCCTT CGCGAAGCCC
GAGGACCCGG CGAGCGCGCC GCGGCGCGAG CCACCCGCCG CGGGCGAACC CCTCGAGACG
CTCTACGGCG AGGGGGCCGA GGCTCCGCCG GCGGACGAGG TGCGCTACCA GGTGCGGCTA
CGGTCGGGGA AGATCGTCGG GCCGCTCGGC GTGCCGCAGG TCATGGGGCT GCGCGTGCGC
GGCGAGCTCA CCGGCGACGA GGAGGTCTGC CGCGAGGGCG AGGACGACTG GTCGCCCATG
AGCGAGGTCG ATGCGCTCGC CGGTGCCGCC GCGGCGCCGG CGAGGCAGGG CGGCCGGGTC
GCGAAGCTCG GCCGCGGCGC CGGCCGGCCC CGCGCCTCCG GGGTGGCGGT GGCCGGAGCC
GCGCTCGCGA TCGTGCTCGC GGTGGGCGTC GGCGCCGGCT TCACGCCCCA CGGCTACTTC
TTCACCGCCG CGCTGCGCGG CAAGGACGGG GCGCGCACGG CGGCGCTCGT CGCCCAGGCG
CGCGCGGCGC TCGCGAAGGG CGACTACCCG TCGGAGCGCA CCGCGCTCGA TCTCGCGGCC
CGGGCGGTGG CGGCGGATCC GGACGCGCGC GACGCGGCGG CGCTGCACGC GATGGTGGTG
GCGGCGCTCG AGCTGCGGCA CGGCGCGCCG CCGGCGGCGC TGGAGCAGGC GCGCCGGGCC
GCGGACCGGC TCGCCGGCGG CGACGCTCCC GACGCGCCGG CGCTCGCCGC GCGGCTCGCG
CTGAGCGTGG CCGGGGGCGG CGGCGCCACC GCGCCGCAGG AGGCCGCGCT CGAGCGCTCG
GGCGGAGCGG CGGCGAGGGA CCCGGAGGTG GTCGCGCTGC TGGCGCGCGC GGCGCTGCTG
AGGGGCGACG CCGCGGCCGC CGCGGCGCGC TTCGAGGCGC TCGCGGCGCT CGAGCCCCAG
GGGGCGCGCG CCCTGCACGG CAAGGCGCTC GCCGCGCTCG CCCGCGGCGA CGCACCGGCG
GCGAAGGCCG CCTTCGAGGC CGCGCTCGCC CGGGACGGAG GTCACCTGCC GTCGCGGCTC
GGGCTCGCGA CCCTCTCCCA GGCCGCCGGC GACCCGGCGG GAACGGAGCG CCACCTCGCG
CCGCTCCTCG CCAAGGACGC GGAGGCGAAG CTCGCGCCGG GCGAGCGGGC GCGCGCGCTC
GCGCTGCGCG CGGAGCTCCT CGCGCGGAGC GCGGCGGGCG CCGCCGAGGC CGTTCGGACC
TGGGAGGCGG CGACGGCGAT CGATCCGCGG GCGACCGAGA TCCGCGTCGC GCTCGCGCGG
CACCGGCTCG CCCACGGCGA CGCCGGCGGG GCGGTCGCCG CGACCGAGCC CGTCGCCGCG
TCCGCCGCGA AGGACGCGGC GCTCGCGGCG GTGCGTGTGC GCGCCCTCGC GGGCGCCGGG
CGGGCGCTCG ACGCCCTCTC CCTGGCGGAC GTGGCGCTCG CCAGCGCGCC GGCGGACGCG
AACCTGTCGC TCGCGAAGGC GGCCGCGCTC GCCGGGGCGG GCCGCATCGA GGACGCGGCG
GAGCTCTACC GGGCGGTGGC GGCGAGGAGC CCCGAGGCGT GGGAGCCGCG GCTCGCGCTC
GGCCGGATCG CGCTCACGCG CCGGCAGCTG GACGCGGCGG CCGTGGAGCT CGAGGCGGCG
GTCGAGCGCG CGCCCCGCGT CGCCGCCGTG CACGTGGGGG TGGGCGATCT CCGGCTCGCC
CAGGGGAACG CCGCGGGCGC CGAGGCGGCC TTCCGCCAGG CGCTCGCGGT CGAGCCCGAG
AGCGCGGCCG CCGAGACCGG CCTCGCCCGG ATCGCCCTCG CCCGCGGGGA CGCGGCGGCC
GCGCGGGCGC GGCTCGATCG CGCCCTCGCC CTCGATCCCC GGGACGCGGA GGGGCACGTC
GCCTTCGGGA CCCTGCTCTG GTCGGCCGGC GATCTGGCGG GGGCGGAGAA GTCGCTGCAG
ACCGCGGTCG AGCTCCAGCC GCGCAACGCC ACCGCGCTCA TGCGGCTCGG CGCCGTGAAG
CTCGAGCGTG GCGACGTGGA CGGCGCCGTG CAGCGGCTCA CGGCGGCGGC CGGCGAGGCG
CCCCAGCTGG CGGAGGCGCA GCAGTGGCTC GGGCGCGCGC TGCTGGCCAG GGGAGAGACC
CCGTCCGCGG TGGCGAAGCT CCGCCGGGCG GTCGAGCTCG ACGGCTCGAA CGTCGACCAC
CACCTGCACC TCGGCGCGGC GCTCGAGCGG GCGAACGCGC TCGACGAGGC GCTCGCGGCC
TACCGCGCCG CCGCCAAGGC CGACCCGCGC CGCGCCGACG CGCACGAGCG GCTCGCGCTC
CTCTTCGCCG CGAACGGCCG CTGCGACGCG GCGATCCCCG CGTACGAGAA GGCGGTCGCC
GCCGCCCCGC GGCTGGCGCG GCTGCAGATC GCGCTCGGCG ACTGCCAGCT CCGCGTGGGG
AAGGCGGAGG ACGCGGCGAA GGTGTTCCGC GCCGTGCTGC GCGCCGACGC GAAGGCCGTG
CCGGTGCTCT ACCGCCTGGG GCGCGCGCTG CACGAGTCGG AGGGCGAGCG CGCGGCGCTG
CCGTGGTACG AGCGCGCCGC CCGCGAGGAC AAGGGCAACC CGATGCCGCA CTACTACCTC
GGCTACCTCT ACAAGGAGCG GGGCGAGCGC CGCCGCGCGG TCGAGGCGTT CAAGGCCTTC
CTCGCCCTGA GGCCCGACGC GGACGAGCGG AAGGACATCG AGGGGGAGAT CGAGGATCTG
GGCGGGGCGC TGTAG
 
Protein sequence
MRVGCPHCHA AYNIDDRRIP ATGLNVRCPK CRETFPVRPA DPAGQGAPVP LPTGTGQASG 
AVPLAPPSSA SEGRPGAAPG VPLPPPAAPA SGGVPLPAPR AAPAVAPDLV TSAGIPLPPP
IPAGNASTDP FAAPPGADPF GMDVPDATAD AVEAAPEGEA LGFGEVDLGG GAPAARAAPS
LDDTDPFAPA PGATSWPPSS AAPAAQPFAA ASGADPFAKP EDPASAPRRE PPAAGEPLET
LYGEGAEAPP ADEVRYQVRL RSGKIVGPLG VPQVMGLRVR GELTGDEEVC REGEDDWSPM
SEVDALAGAA AAPARQGGRV AKLGRGAGRP RASGVAVAGA ALAIVLAVGV GAGFTPHGYF
FTAALRGKDG ARTAALVAQA RAALAKGDYP SERTALDLAA RAVAADPDAR DAAALHAMVV
AALELRHGAP PAALEQARRA ADRLAGGDAP DAPALAARLA LSVAGGGGAT APQEAALERS
GGAAARDPEV VALLARAALL RGDAAAAAAR FEALAALEPQ GARALHGKAL AALARGDAPA
AKAAFEAALA RDGGHLPSRL GLATLSQAAG DPAGTERHLA PLLAKDAEAK LAPGERARAL
ALRAELLARS AAGAAEAVRT WEAATAIDPR ATEIRVALAR HRLAHGDAGG AVAATEPVAA
SAAKDAALAA VRVRALAGAG RALDALSLAD VALASAPADA NLSLAKAAAL AGAGRIEDAA
ELYRAVAARS PEAWEPRLAL GRIALTRRQL DAAAVELEAA VERAPRVAAV HVGVGDLRLA
QGNAAGAEAA FRQALAVEPE SAAAETGLAR IALARGDAAA ARARLDRALA LDPRDAEGHV
AFGTLLWSAG DLAGAEKSLQ TAVELQPRNA TALMRLGAVK LERGDVDGAV QRLTAAAGEA
PQLAEAQQWL GRALLARGET PSAVAKLRRA VELDGSNVDH HLHLGAALER ANALDEALAA
YRAAAKADPR RADAHERLAL LFAANGRCDA AIPAYEKAVA AAPRLARLQI ALGDCQLRVG
KAEDAAKVFR AVLRADAKAV PVLYRLGRAL HESEGERAAL PWYERAARED KGNPMPHYYL
GYLYKERGER RRAVEAFKAF LALRPDADER KDIEGEIEDL GGAL