Gene Anae109_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1840 
Symbol 
ID5377989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2095339 
End bp2097312 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content71% 
IMG OID640843348 
ProductTPR repeat-containing protein 
Protein accessionYP_001379027 
Protein GI153004702 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.829292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGGGC TCCATGGGCG CGTGATATCG TTCGCGGCCC AGCTCTCTCC CTCACCGGAT 
GCGACGCGAA TGAAACGGAC GGCCCCTGAA CCGAAGACGC CCGGAGCCGA GGGGCACTGG
CTCCCGCTGC TCGCCGTCAC GGTCCTCGGC CTCCTCTCCT ACTCGAACTC GTTCGGCGCC
GCGTTCGTCT TCGACGACGT CCCGAGCATC GTCGAGAACG TCCTCATCCG CGATCTGCGA
CATTACCTGC CAGGCGGCGG CACCTACGCG GCAGCGCCGA ATCGCTACCT CGGATACCTG
AGCTTCGCGC TGAACTACGC CGCAGGCGGC CTGGATCCGG CCGGGTACCA CGTCGTCAAC
CTGTGCGTGC ACCTCGCGAA CGCGTTGCTG GTCTACGCGC TCGCGATCCT CACCTTCCGG
ACGCCCGGCG TGCGGGACTC GAAGCTCGCC GGGTCGTCGC GCATCGTCGC GTTCGCCGCC
GCGGCGCTCT TCGTCGCGCA CCCGCTCCAG ACGAACGCGG TCACGTACGT GGTCCAGCGG
TTCGCGTCGC TCGCGACGAT GTTCTACCTG CTCGCCGTCG TGCTGTACGC GTGGTGGCGC
CTGCGGCAGG AGACCCATCC GCTCGCGCCC GCCTGGAGCG CCGCGAGCTA CGGGCTCGTG
CTCGCGTCGG TCGTCGCCGC CATGCGAACG AAGGAGATCG CCTTCACGCT TCCGCTCGCG
ATCGCCGGAT ACGAGCTCGT CTTCTTCGGG CCTCTCCGCA AGAAGACCGC CCTCGCGCTC
GCGCCGATCC TCGCGACGCT CGCGATCATC CCGCTCACGA TGCTGGGAGT GACCGCGGAC
GTCGGCGCTG CGGTCCAGCG CCTGGAAGGA GCCACGCGGA ACTCCGAGCT GTCCCGGGTC
GAGTACCTGA TGACGCAGCT GCCGGTCGTG GCGAGCTACC TCCGGCTCCT GATCGCTCCG
GTCGGGCTGA CGTTCGATCA CGACGTGCCG ATCGCGCGAT CGTTGCTCGA GCCACGCGTG
CTGCTCGGGC TCGTGGTCGT CGGGGGTCTC CTCGCGGTGG CCGCTCACCT GCTGTGGCGC
TCCCGCCCGG GCGCGGCGGG ACGTCTCGAT CCCAGCGGCC GGGTCGTGGG GTTCGGGATC
CTCTGGTTCT TCCTCGCGCT CTCCGTCGAG TCGAGCGTCA TCCCGATCAG GGACGTGATG
TTCGAGCACC GCGTCTACCT GCCGTCCGCG GGCCTCTTCA TCGGCGCCGT CACGGCCGCG
CTCTGGCTGG CGCGGCGGTA CTCCGTCCCG GACCGGGTCA CCGTCGCGAC GGCAGCGGTG
CTGGCGATCG CGCTCGGCGC GGCGACGTTC GCACGCAACC TCGTCTGGCA GGACGAGCTC
GCGCTGTGGG GCGACGCCGT CGCGAAGGCG CCGAACAAGA CGCGGCCGCA CACGAACTAC
GCCCACGCGC TCCGCAAGCT CGGGCGCGAC GAGGAGGCGC TCGAGCACTT CCGCGCGGTG
GTCCGCATCT CGCCGGACCA CGTGGACGGG ATGAACAACC TCGCGGTCCA CCTGGAGCGG
CTCGGACGCA CGGACGAGGC GGCGCAGTGG CTCGAGCTGG CGCTCCGCTC CAAGCCCGAT
CACGCGGAGA CCTACGTGAA CCTGGGGCGC CTGCTCCTGG TGGAAGGGCG CGCGCCGGAG
GCCGCCGCGC TCTCGCAGAG GGCGATCGAG CTGGACCCTG GTTACCGCGA TGCCTACGCG
AACCTCTCGG GTGCGCTGAA CTCTCTCGGC CGTCCCGACG AGACGCTCCG GCTGCTGGAA
TCGGCGCCGC AGGTCGTGCA GGCCAGCCCG GAGGCGCGCT TCAACCTCGG CGTCGCGCAC
GTGCTGCTGG GGAACGCCGC CGGCGCGAGC GCGCAGCTGG CGGCGCTCCA GCGCGGCGGC
AGCGATCGGG CAGGTCAGCT CGCCGCGTTC ATGGCGTCCC GCGGCATGCG GTGA
 
Protein sequence
MSGLHGRVIS FAAQLSPSPD ATRMKRTAPE PKTPGAEGHW LPLLAVTVLG LLSYSNSFGA 
AFVFDDVPSI VENVLIRDLR HYLPGGGTYA AAPNRYLGYL SFALNYAAGG LDPAGYHVVN
LCVHLANALL VYALAILTFR TPGVRDSKLA GSSRIVAFAA AALFVAHPLQ TNAVTYVVQR
FASLATMFYL LAVVLYAWWR LRQETHPLAP AWSAASYGLV LASVVAAMRT KEIAFTLPLA
IAGYELVFFG PLRKKTALAL APILATLAII PLTMLGVTAD VGAAVQRLEG ATRNSELSRV
EYLMTQLPVV ASYLRLLIAP VGLTFDHDVP IARSLLEPRV LLGLVVVGGL LAVAAHLLWR
SRPGAAGRLD PSGRVVGFGI LWFFLALSVE SSVIPIRDVM FEHRVYLPSA GLFIGAVTAA
LWLARRYSVP DRVTVATAAV LAIALGAATF ARNLVWQDEL ALWGDAVAKA PNKTRPHTNY
AHALRKLGRD EEALEHFRAV VRISPDHVDG MNNLAVHLER LGRTDEAAQW LELALRSKPD
HAETYVNLGR LLLVEGRAPE AAALSQRAIE LDPGYRDAYA NLSGALNSLG RPDETLRLLE
SAPQVVQASP EARFNLGVAH VLLGNAAGAS AQLAALQRGG SDRAGQLAAF MASRGMR