Gene Anae109_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3966 
Symbol 
ID5376009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp4626072 
End bp4627421 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content70% 
IMG OID640845490 
Producttype II secretion system protein E 
Protein accessionYP_001381128 
Protein GI153006803 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.406102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTCG GTGAGCGGCT GCGGCAGCGC GCGAGCGCGG CGAGCCCCGC GGCGGCGGCG 
CTGGTCCCGG AATCTGCGCC CGAGGTCTTC CACGACCTCA AGAGCGAGCT GCACCGCCGC
ATCATCGACA AGCTCGATCT GCAGGCCTTC GATCGCCTCG CGCCCGAGCG CCGCCGCGAC
GAGCTGCGGG CGGTGCTGTC GGGCGAGATC GGCCGCTCGG AGCTGCCGCT CAACCAGCTC
GAGCGCGAGC GCATGATCGG GGAGCTCCTG GACGAGCTCA CCGGGCTCGG GCCGCTGGAG
CCGCTCCTCG CCGACTCGAC CATCTCGGAC ATCCTCGTCA ACACCTACTC GACGGTCTAC
GTCGAGCGCC GCGGCAAGCT CGAGCTCACG GCGGTGCGGT TCGGCTCGAA CGGCCACCTC
CAGCAGATCA TCAACCGCAT CGTGGCGCAG GTGGGGCGCC GCGTGGACGA GACGTCGCCG
ATGGTGGACG CGCGCCTCGC CGACGGTTCC CGCGTGAACG CGATCATCCC TCCGCTCGCG
ATCGACGGCC CCATCCTCTC GATCCGCCGC TTCGGCGTGT CCCCCCTCAA GGTCCGGGAC
CTCGTGACGA ACGGCGCGCT CACCCCGGAG GCGGTCGGGT TCCTGGGCGC GTGCGTCAAG
GCGAAGCTGA ACGTCCTCAT CAGCGGCGGG ACCGGCGCCG GCAAAACCAC GCTCCTCAAC
GCGCTCTCTT CGTTCATCCC GGACACCGAG CGCATCGTCA CCATCGAGGA CTCGGCCGAG
CTCCAGCTCC AGCAGCGGCA CGTCGTTCGG CTCGAGACGC GGCCGGCGAA CATCGAGGGG
AAGGGCGAGA TCATCGCGCG CGATCTGGTG AAGAACGCCC TGCGCATGCG GCCGGACCGG
ATCGTCGTCG GCGAGGTGCG CGGCGGCGAG GTGCTCGACA TGCTGCAGGC GATGAACACG
GGCCACGAGG GCTCGATGAC CACCGTTCAC GCGAACACGC CGCGCGACGC GCTCTCCCGC
ATCGAGGCGA TGATCGGCAT GAGCGGCGTG CCCCTGAGCG AGGGGGCCAC GCGCGCGACG
ATCTCGCGCG CGCTCAACAT CATCGCGCAG CTGAACCGTG GCACCGATGG CCGCCGGCGC
ATCATGTCGA TCGCCGAGAT CACCGGCACC GAGGGCGCCG CGATCACCAT GCAGGAGATC
TACCGGTTCG AGCAGCGCGG GGTGGACTCC ACCGGCAAGG TCATCGGAGA GTTCATCCCG
ACCGGCATCC GCGCTCGCGC GATGACGCGC ATCGCGCAGT TCGGGGCCGA CCCTGCAGCG
ATCGCCGCGC GCGTGCTGGA GGACCGCTGA
 
Protein sequence
MGLGERLRQR ASAASPAAAA LVPESAPEVF HDLKSELHRR IIDKLDLQAF DRLAPERRRD 
ELRAVLSGEI GRSELPLNQL ERERMIGELL DELTGLGPLE PLLADSTISD ILVNTYSTVY
VERRGKLELT AVRFGSNGHL QQIINRIVAQ VGRRVDETSP MVDARLADGS RVNAIIPPLA
IDGPILSIRR FGVSPLKVRD LVTNGALTPE AVGFLGACVK AKLNVLISGG TGAGKTTLLN
ALSSFIPDTE RIVTIEDSAE LQLQQRHVVR LETRPANIEG KGEIIARDLV KNALRMRPDR
IVVGEVRGGE VLDMLQAMNT GHEGSMTTVH ANTPRDALSR IEAMIGMSGV PLSEGATRAT
ISRALNIIAQ LNRGTDGRRR IMSIAEITGT EGAAITMQEI YRFEQRGVDS TGKVIGEFIP
TGIRARAMTR IAQFGADPAA IAARVLEDR