Gene Elen_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0043 
Symbol 
ID8414322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp57367 
End bp58422 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content67% 
IMG OID645023018 
Producttwitching motility protein 
Protein accessionYP_003180426 
Protein GI257789820 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTG AGCACATCAT AGACATGGCG CGCCAGGTAG GCGCTTCCGA CGTGCACCTG 
GTGTGCGGTC TGCCCGTGAA GTTCCGCCTG GCCGGATGCC TTGAGAACGC GGGCGTTGAC
GGCGACGCGC CGCTGTCGCA TGACGATTGC GAGCAGCTGG CCCGCCGTTT GGCCGGAGAC
GGCTTCGACC GCATCCAGCG CATCGGCGAG CTGGATCGCG CCGAGACCAT CGGGGGCGTG
CGCGTGCGCA TCAACCTGTT CCGCCAGCAA GGCCATGTGA GTGCGGCTTT GCGCCTGCTG
TCCGATCGCA TCCCCGCGCT GGAGACGTTG GGCTTGCCGT CTGCGGTCAT GGACTTCCCG
CGCATCCAGC GCGGCATCGT GGTGGTGACG GGCGAAACCG GCAGCGGCAA GTCCACCACG
CTGGCGGCGC TCATCGACAG CATCAACCAC ACGCGCGCCG AGAACATCAT CACCATGGAA
GACCCAATCG AGTACGTGTA CACGCCCGAC CAGTCCGTCA TCTCGCAGCG CGAGATCGGC
CAGGACACCG AAAGCTACAG CAACGCCCTG CGCGCCGTGT TGCGCGAGGA CCCCGACATC
ATCCTCGTCG GCGAGATGCG CGACCTCGAC ACCATCCAAA CGGCGCTGAC CGCCGCCGAG
ACGGGCCACT TCGTGCTGGC CACGCTGCAC ACGAAAAGCG CTGCCGACTC CATCGACCGC
ATGGTGGACG TGTTCCCCGA GGGCTTGCAG CGCCAGGTGC GCATGCAGCT GTCCACCACG
CTGGTGGCCG TGCTGTCGCA GCAGCTGCTG CCGCGCCGCG ACGGCATGGG CCGCGCGCTG
GCGTGCGAGC TGATGATGGT GACGCCTGCC ATCCGCAACC TCATCCGCGA GGGCAAGACG
CCGCAGATAG CCGGCTCGCT GGCCACGTCG GCCTCGGCGG GCAGCGTGAC CATGGACAAC
GCGCTGATCG CCCTGGCCCG CAACCGCGAC ATCACGTCCC GAACCGCCAT CGATGCCGCG
CACGACGTCG ACTACGTGAG AAAGAGCGTT CGCTGA
 
Protein sequence
MNVEHIIDMA RQVGASDVHL VCGLPVKFRL AGCLENAGVD GDAPLSHDDC EQLARRLAGD 
GFDRIQRIGE LDRAETIGGV RVRINLFRQQ GHVSAALRLL SDRIPALETL GLPSAVMDFP
RIQRGIVVVT GETGSGKSTT LAALIDSINH TRAENIITME DPIEYVYTPD QSVISQREIG
QDTESYSNAL RAVLREDPDI ILVGEMRDLD TIQTALTAAE TGHFVLATLH TKSAADSIDR
MVDVFPEGLQ RQVRMQLSTT LVAVLSQQLL PRRDGMGRAL ACELMMVTPA IRNLIREGKT
PQIAGSLATS ASAGSVTMDN ALIALARNRD ITSRTAIDAA HDVDYVRKSV R