Gene Emin_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1142 
Symbol 
ID6262603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1241186 
End bp1242181 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content38% 
IMG OID642611622 
ProductTPR repeat-containing protein 
Protein accessionYP_001876031 
Protein GI187251549 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TGTTTATTTT AGCTGTTTTA TGTTTAAGTG TTTTTACAAT AGGCTGCGGC 
AGCAATAACG CTAACTTTAA GAAAGCTTTG TATTATACCG ACATCGGCCG GTATAATGAT
GCTTTAAACC TTTACGGTAA AATAATAAAA TCCGATCCTA ATAACTATGC CGCGTATTCC
AACCGCGCCA TGGTGCATGA AAAAATTGCG GCCGCTATTT CTTTTAAAGA TTTAAAACTC
AGGCAGCAGC ATCTTGATTA CGCTGAAAAA GACTATTTAA AAGCGGTAAA ACTTAATCCT
AACGACGCTA AAATTTTAAA TAATTTAGGA GCTTTTTATA TTGACAGAGG CCAGTATTAT
AACGCTATTA TTTATCTTAA CGAAGCCTTG AGAGCAAGGC CCAATTATTA TAACGCGCTT
GTAAACAGGG GCATAGCGTT TTATAACGCG GGCGAAGGCA TTAAAGCGTA TAATGATTTC
CATAAGGCTA TAAACATAAA TAAGGACGGC TGGCTGGCTT ATTATAACAG AGGGTTGTTT
TATTATGACA TAGGTGACTA TCTTAACGCC GCTTTAGACC AGACCAGGGT TATAAATTTA
AAACCTTCTT ACGGTAAAGC GTATCTTGAA AGAGGGCGCG CTTTAAAATT AAATAATATG
TACGCCGACG CTCTTGATGA TTTTAAAATG GCTGTTGAGC TCGCGCCTAA CAACGCCGTT
GCGCGTTATT ATTTAGCTGA AATGTTTTTT AAAAACCACG ACCTGGGCGG CGCTTTAAGC
GAACTTTTGA TATCAAAACA ACTTGACCCG AGGTTTGTTC CCACCTACGA ACTTATGGGC
GATATTTTAG CTTTGGAAGA CAATGTTTCC GCCGCGGCTA ATTATATAAT AGCCAAAAAA
CTTGATCCCG CCAACGCCAG AAAATATGAC GTGAAAATAA GAAGGCTTCT TTCTGATCAG
GGCGTACGCA GAACCGTTGA AAGCAGATTC TATTAA
 
Protein sequence
MKKLFILAVL CLSVFTIGCG SNNANFKKAL YYTDIGRYND ALNLYGKIIK SDPNNYAAYS 
NRAMVHEKIA AAISFKDLKL RQQHLDYAEK DYLKAVKLNP NDAKILNNLG AFYIDRGQYY
NAIIYLNEAL RARPNYYNAL VNRGIAFYNA GEGIKAYNDF HKAININKDG WLAYYNRGLF
YYDIGDYLNA ALDQTRVINL KPSYGKAYLE RGRALKLNNM YADALDDFKM AVELAPNNAV
ARYYLAEMFF KNHDLGGALS ELLISKQLDP RFVPTYELMG DILALEDNVS AAANYIIAKK
LDPANARKYD VKIRRLLSDQ GVRRTVESRF Y