Gene Anae109_2481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2481 
Symbol 
ID5377430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2874757 
End bp2876592 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content72% 
IMG OID640844000 
Producttype II secretion system protein E 
Protein accessionYP_001379666 
Protein GI153005341 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.271892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.269219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCAGGA CCTATCACGC GACCGACTAC ACCCTCGAAT TCGTGGCGGA TCTCCTCGCG 
CGCCAGGGGA TCCTCACCGA CGACGCCAAG CGCACCGCCT TCGCCCGCGA GAACGTGCAG
CGCGCCCGGC TGCTGCGCGA CCACGCCTCG CGCGCGGGAG GCCGAGGGTT ACGCCGGGCG
GAGCTCTCGC CCATCGAGGT GCTCGCCTCG TTCGGCTTCA CCGACGCCCG CCAGGAGTCC
GAGGTCGTCG ACGAGGACAA GGCGACGCAG GCGGTGGCGC AGGCGGTAGG GGTCCCCTAC
CGCAAGATCG ATCCCCTCAA GCTCGACGCG CAGCTCATCA CGCGCACGCT GTCGCGTCCG
TTCGCGCGGC GTCACGGGGT GCTCCCGCTC GAGCGGCGCA ACGGGGCGCT GGTCGTCGCC
GCGGCGAACC CCTTCGACCG CGAGCTGTTC GAGAACCTGC GCGGCCTCAC CGGCGCGGAG
ATCGAGCCGG TCCTCTCGTC GCCCGCCGAC ATCCACCGCG CCATCGCCGA GGTCTACGGG
TTCCGCCAGC AGATCTCCGA GGCGCGCACC CAGCTCGAGC AGAGCGACGC CGCCCCGGAC
GTCGCGAACC TCGAGCAGTT CGTGAACCTC TCCGGGATCG AGGCGCTCGA GGCGTCCTCC
GAGCCGGTGA TCGCGGCCGT GGAGTACCTC CTCCACTACG CGTTCGAGCA GCGGGCGAGC
GACATCCACC TCGAGCCCCG GCGGGAGGAG TCGATCATCC GCATGCGGAT CGACGGCGTC
CTCCACCCGG TCCACCGCAT CCCCAAGGCC GTCCACGGCG CGATCGCCAA CCGGTTCAAG
ATCATGAGCC GGCTCGACAT CGCGATGAGG CGCCCGCAGG ACGGGCGCAT CCGCACCGCG
CGCGGGGACG CGGAGATGGA GCTGCGCGTC TCCACGATGC CGACCACCTT CGGCGACAAG
GTCGTGGTCC GCGTGCTCGA CCCGACCGTC CTCGTGCGCG ACCTCTCCGA GCTGGGCTTC
CTCCCGGACG AGCGCGACGC GCTCGAGCGC TGGCTGGTGC GGCCGCACGG GCTCGTGGTG
GTGACGGGGC CGACCGGCAG CGGCAAGACG ACCACCCTCT ACTCCGCGCT CCAGGCGCTC
GCCTCGCCCG AGGTGAACGT CGTGACCATC GAGGACCCCA TCGAGATGGT GCACGAGGAG
TTCAACCAGA TCGCGGCGAA CGCGAGGACG GGCACCGGCT TCGCGGAGGC CCTCCGCCAC
GTGCTGCGGC AGGACCCGGA CGTGATCATG GTCGGCGAGA TCCGCGACGG CGAGACGGCC
GCCCAGGCGG TCCAGGCGGC GCTGACCGGA CACATGGTGC TCACCACCCT GCACACGAAC
GACACGGTGA GCGCGGTCGC CCGGCTGCGC GACCTCGGCG TGCCGAGCTT CCTCGTCGCG
GCGACCCTGA CCGGCGTCGT CGCGCAGCGG CTCGTCCGCC AGGTGTGCCC GTCCTGCGCA
GCCGACGTGC CGCTGACGGC GGACGAGATC CACGCGCTCT CCGTGCCGCA CCCGGAGGAC
CACGCGGGCC AGCTCCTCGG GCGCCGGGGG CAGGGCTGCG CGAAGTGCCG GTTCACCGGC
TTCTACGGAC GCACCGGCAT CTTCGAGGTG CTGCCGGTGA ACGCCCGGCT CCGCCACCTC
GTCGCCGAGG GCGCCACGCC CGAGGTGCTC GCGCGCACGG CGCGGCAGGA CGGGCTCCGC
TCGCTCCGCG ATCACGCGGT GCGCAAGATC GCCTCCGGAG TGACCTCCTT CGAGGAGGCG
TTCCGCGCCA CCGCCGACGC GGAGGCCAGC GCGTGA
 
Protein sequence
MPRTYHATDY TLEFVADLLA RQGILTDDAK RTAFARENVQ RARLLRDHAS RAGGRGLRRA 
ELSPIEVLAS FGFTDARQES EVVDEDKATQ AVAQAVGVPY RKIDPLKLDA QLITRTLSRP
FARRHGVLPL ERRNGALVVA AANPFDRELF ENLRGLTGAE IEPVLSSPAD IHRAIAEVYG
FRQQISEART QLEQSDAAPD VANLEQFVNL SGIEALEASS EPVIAAVEYL LHYAFEQRAS
DIHLEPRREE SIIRMRIDGV LHPVHRIPKA VHGAIANRFK IMSRLDIAMR RPQDGRIRTA
RGDAEMELRV STMPTTFGDK VVVRVLDPTV LVRDLSELGF LPDERDALER WLVRPHGLVV
VTGPTGSGKT TTLYSALQAL ASPEVNVVTI EDPIEMVHEE FNQIAANART GTGFAEALRH
VLRQDPDVIM VGEIRDGETA AQAVQAALTG HMVLTTLHTN DTVSAVARLR DLGVPSFLVA
ATLTGVVAQR LVRQVCPSCA ADVPLTADEI HALSVPHPED HAGQLLGRRG QGCAKCRFTG
FYGRTGIFEV LPVNARLRHL VAEGATPEVL ARTARQDGLR SLRDHAVRKI ASGVTSFEEA
FRATADAEAS A