Gene Athe_2453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2453 
Symbol 
ID7408077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2592967 
End bp2594157 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content38% 
IMG OID643716816 
Producttype II secretion system protein E 
Protein accessionYP_002574294 
Protein GI222530412 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTTT TAGAGAGAAT AAAAAGTCAT TCTTCAGAAG AATTTTCAAC ATATGACGAA 
CTTTTAACAT ATGTCAGAAA CAAAGTTCTG AGTAGCAGTC TGGATGTTGC AGGTAACCTT
GACCTTTTAA AAGAGTTAAC AAAACAATAC GTCGATAAGT ACTATGCAGA AAACTATTTA
GGACTTGCAC AGCAGCATAT TTACGATTAT GTTTACAATA TGTTATTTGG ACTCGGACCA
ATCGAAAAGC TTTTAAAGTC CCCAGATGTA ACTGAAATAT ACGTGATGGG GACAAAGATA
TACTATATAG AAAACGGACT CAGAAAAGAG TTAGAGGAAA AGTACCCAAA CGAGGCTGAA
ACACAGCGCG TTATCGAAAA GATAGCAGCA ACAGCAAGAC AAACAATAAA CATTCAAAAC
CCTGACATTG ACTGTGAGCT TTACGATGGT TCAAGAGCGT TGTTAGTCAT TCCCCCAGAA
AGCGTTCAAC CCTATATCAC TATCAGAAAA CACACATCAA AATTAAAAAC TCTTGAAGAG
CTAAGAAGCG GTTATATAAA CTTTGAAGAC TGGATGATAG ACTACTTTAA AAATGCTGTG
CGCAGCAGAA AAAATATAGT GGCAGTAGGT CAAACTAACG CTGGCAAAAC TACTTTTTTA
AACGCTCTGA CATATTACAT TCAAACAAAT CACGTTGTTG CAGTGCTGGA AGACACCCAC
GAGGTCGAAC TTCCTTTGAG GTACGTTTAT TATTTCAAAA CAAGAGAAGG AAACGAAGAA
CTAAGACCTA TAACGTGGAG CGACATAATA CTAAATTGCC TCAGAGCAAA CCCCGACAGA
ATATTTATAA CAGAAATACG AACCCCGGAA GCAGCGTATG GTTTTCTGGA CGCACTGAAC
TCTGGACACA GGGGAAGTCT CACAACCATA CATGCAGGGT CAACTTACCT TGCTCTGCAA
AAGCTTGAAA TGAAATTAAA AGAGTTCAAT CCCAACCTAG ATGTTCGTAA CATGAGGATT
TTAATTTCAA GTACAATAGA CGTATTAGTA TTTTTGGACA TTGCAGAAGA TGAAACTGGA
AACATTCTGG GAAGAGTTAT TAAAGAAATC GCAGAGCTAA AAGGGCTGAA TAGCGATGGG
ACTTACAAGC TTGATTATGT ATACAAGTAC GAACAACAGG AAAAGAGGTG A
 
Protein sequence
MGLLERIKSH SSEEFSTYDE LLTYVRNKVL SSSLDVAGNL DLLKELTKQY VDKYYAENYL 
GLAQQHIYDY VYNMLFGLGP IEKLLKSPDV TEIYVMGTKI YYIENGLRKE LEEKYPNEAE
TQRVIEKIAA TARQTINIQN PDIDCELYDG SRALLVIPPE SVQPYITIRK HTSKLKTLEE
LRSGYINFED WMIDYFKNAV RSRKNIVAVG QTNAGKTTFL NALTYYIQTN HVVAVLEDTH
EVELPLRYVY YFKTREGNEE LRPITWSDII LNCLRANPDR IFITEIRTPE AAYGFLDALN
SGHRGSLTTI HAGSTYLALQ KLEMKLKEFN PNLDVRNMRI LISSTIDVLV FLDIAEDETG
NILGRVIKEI AELKGLNSDG TYKLDYVYKY EQQEKR