Gene Acel_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2090 
Symbol 
ID4485679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2364974 
End bp2366695 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content65% 
IMG OID639730890 
Producttype II secretion system protein E 
Protein accessionYP_873848 
Protein GI117929297 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTCC AACCGACGGA AATCGATCCG CCGCAGCTGC CGCAGCGGCG CCGGCTCGGT 
GACGTCCTCG TCGAGCGGGA CCTGCTGACC CGCGAGCAGC TGGAGGAGGT TCTTGCGGCA
CAGCGACGAC TCACCGGCCG AGACCGCAAA CGCCTCGGTC AACTGCTCGT CGAGATGGGC
TACCTCACCG AGCGACAGGT CGCTCAGGCG TTGGCGGAGC TGCTCGCCCT CGAGCTCGTC
GACGGGAACG ATCTCGCGGT CCCGATGGAG GTCGCCCGGC TGCTCCCCCG GCAGGTCGCA
GAACGAGCCC GGGTTCTTAT ACTGGGCCGC ACGCCGGACG GTCTGAAGGT CGCAACCGCC
GATCCGACAA ATGTCGTCGC GTTGGACGAC GTTCGCGCCT ACACCGGCGC GCACTCGCTC
TCCGTTGTCG TCGCTCCGGA ATCCGTCATC AAAGAGCAAA TTGCCCGCGT GTACTCCATG
GCGGCGGAGG CTCAGCTTGA CGCCAGCAAG GATGAGGACA CCAAAGCGAA TCTCCTCGAG
GACGCAGAAC TTGCCCGCGC TGCGGACCAG GCGCCCACCG TCCGGCTCGT CAACCAGATA
CTCACCGATG CGATTCGGAT GGGCGCAAGT GACGTGCACA TCGAGCAGCA GAGCGACGGC
GTGTGGGTTC GCCATCGCAT TGACGGCGTC TTGCGGGACA TTACGCGGGT GCCGCGCGGA
GCTGCACCCG CGCTGATCAG CCGATTGAAA ATCGTCTCCG GGATGGACAT CGCCGAGCGG
CGCCTTCCGC AGGACGGGCG GATGAAAATC GACACCGACG GTGTCAGCAC GGAGGCGCGG
GTGAGCAGTC TGCCCGCAGT ACATGGGGAA AAAATTGTCA TCCGCCTGCT CGCCAGCGCG
GACCGCATCA CACGGGTCGA TGGATTGGGC ATGGAACCCG CCCAGCGGGA CATTCTCTTG
GCGGCGGCCC GGGCGGCGCA GGGGCTCATT CTCATCACGG GACCAACCGG CTCCGGCAAG
ACGAATACGT TGTATTCGGT TCTGGTGGAC ACCGCGACCC GGGAGAAGAA CGTCGTTACA
CTCGAAGATC CGGTGGAAAT CCGGCTTCCC GGCATTACCC AAGTGCAAAT CGACCAACGC
GCCGGTCTGA CGTTCGCCCG CGGACTCCGT GCGGTCCTGC GTCAAGACCC GGACGTCATT
CTCGTGGGTG AGGTACGCGA CAGCGAAACA GCGCACCTTG CCCTGGAAGC CGCCCTTACC
GGGCATCTCG TCTTGACCAC CCTGCACACG AACAGCGCCC CGGGAGCGGT GACCCGGCTC
GTCGAAATGG GTGTCGAGCC GTTCCTCGTT GCCTCCTCGC TCCGACTCGT CGTGGCTCAG
CGACTTCTTC GCCGGCCTTG CCCCGGCTGC GCCAAGCCCT ATCGCCCCGA CGACGACGTG
CTGCATCGCC TCGGCGTACG GACGGAATTG CCTTCGGACG CCGCGCCGGT GCGTGGCGTG
GGCTGTGTGG AATGCAACGG TACCGGCTAC CGTGGACGGA CCGGAGTGTT CGAAGTACTT
CCCATCAACG ACGAAACCCA TCGCATTGTC GTGCGGAATC CGACGGAAGC TGCCATCGCC
GAAGCGGCGG CCCACGTCGG CATGCGCCCG CTCCGTCAAG CGGCCATTAG CAAGGCCTTC
CGCGGCGAAA CGACCTTTGA AGAGGTCCTC CGCGTCTGCT GA
 
Protein sequence
MSVQPTEIDP PQLPQRRRLG DVLVERDLLT REQLEEVLAA QRRLTGRDRK RLGQLLVEMG 
YLTERQVAQA LAELLALELV DGNDLAVPME VARLLPRQVA ERARVLILGR TPDGLKVATA
DPTNVVALDD VRAYTGAHSL SVVVAPESVI KEQIARVYSM AAEAQLDASK DEDTKANLLE
DAELARAADQ APTVRLVNQI LTDAIRMGAS DVHIEQQSDG VWVRHRIDGV LRDITRVPRG
AAPALISRLK IVSGMDIAER RLPQDGRMKI DTDGVSTEAR VSSLPAVHGE KIVIRLLASA
DRITRVDGLG MEPAQRDILL AAARAAQGLI LITGPTGSGK TNTLYSVLVD TATREKNVVT
LEDPVEIRLP GITQVQIDQR AGLTFARGLR AVLRQDPDVI LVGEVRDSET AHLALEAALT
GHLVLTTLHT NSAPGAVTRL VEMGVEPFLV ASSLRLVVAQ RLLRRPCPGC AKPYRPDDDV
LHRLGVRTEL PSDAAPVRGV GCVECNGTGY RGRTGVFEVL PINDETHRIV VRNPTEAAIA
EAAAHVGMRP LRQAAISKAF RGETTFEEVL RVC