Gene Emin_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1154 
Symbol 
ID6263770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1253072 
End bp1254607 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content38% 
IMG OID642611634 
ProductTPR repeat-containing protein 
Protein accessionYP_001876043 
Protein GI187251561 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones104 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TTTTATGTTT TTTAATTTTA TCCGCGCCGT TTTTTCTACA AGCGGCCGAG 
CCTGACTACG GCAAAACGTT TTATATGGAC TATATAAAAG GCCTTTTGCA TGTTAGCGAT
AAAAAATATG ACAAGGCGCT TGATCATTTT GAGAAAAATT TAAAGGAATT TCCCGAGTCC
GAGTTTCTAA AAACGCTTAT TTTGCAAACT GCAATAGCTG CCGGGAAAGA AGATAACTAT
GAGGAAATAG CAAAAGAGGT TTCCCAATAT AAAGATAAAA ACTCTTTAAT TGCTTCCGCT
GCTTACAGCT GGTCAAAAGG ACATTTGAAA GACGCTCTTT CTTATTATGA AAGCGCGCTC
GCTTTGGATC CTGAAAATAC GGCTATTTTG GCCCAGTATC TTACGCTTTT AAACGGTATG
GACAGTGAAA GAGCCGTGGC CTTTTTGGAA GAATACGCCG AAAAAGTGCC CGAACTTGCG
GCGGTTATTT TTCAGGAAGC GGGCAATGTT AATTTGAAAA GGGGCCGTAC GGAAGACGCT
CTTACCATGT ATTTTAAAGC AACGAAAGCA AACCCCCGAT ATGCGGAAGC CTATATAAGC
CGGGCTGAAA TTTACCAAAA GCAGTCTAAA TTACAAGAAT CGCTTAAAGA ATATAAAAAA
CTTGAAGACA TGGGTTTAGC GGACACGTAT GTTTACCTTA GAATAGGAAC GCTGCATGTT
CTTTTAAAAA ATATTCCTGA AGCAAGAAAA TATTTTGAAA AAATTTTATC TTATGATCCT
TCCAGCATTT TGGCCAACCA GTTCATGGCT GCTATATCGG AGGATGAAAA AAATTATGCT
GCCGCTTTAA AATATTTGCA AGCGGCCGGA GATTACAAAA CCAATGCTTC AAAACTTTTG
CAGGCTTCTT TTTACGCGGC GAGAATGGGA AATGCGGAAG AAGCGTCATC CATTTTAGAT
AATGCTTACA AAGTATCCGA TAAAAGCGTT GAGGTAGGTT ATTTTTATGC GGTGTCTTTG
CAAGATTTAG GTAAGCATAA GGAAGCTGTT AAAATTTTTA AAGAGATTTT GTCCCAAACT
CCGCAATATG AAAAAGCGCG TATGATGTAC GGCGTTTCTT TAGACGCTTT GGGCGATAAC
GCGGAGCTTG AAAAACAAAT GAGAATAGTT GTAGGGCAAA ATCCCGCTAA TTCCGAGGCT
CTAAACTCTT TAGCCTACGC GTTGCTTGAG CAAAACAAAA AACTAAAGGA AGCTAAAAAA
CATATTGACA GATCACTACA GCTTAAGCCT GACGATTATG CAACCATTGA TTCACTGGGA
TGGTATTATT ATAAAACTAA AGATTATGAT AAAGCGCTTG AATATTTTGA AAAAGCTTTG
TCCAAAATGC CGGACGATAA AGTTATAGCT GGGCATAAGG GGCTTGCTCT GTACCGTTTG
GGAAGGTATA AAGAAGCTTT GCCGTGGATT ATAAAGGCTG AAGATAAAAA GCTAAATAAG
TATATAAAAA AAGCAGAAAA AAAATCGGGG GAATAA
 
Protein sequence
MKKFLCFLIL SAPFFLQAAE PDYGKTFYMD YIKGLLHVSD KKYDKALDHF EKNLKEFPES 
EFLKTLILQT AIAAGKEDNY EEIAKEVSQY KDKNSLIASA AYSWSKGHLK DALSYYESAL
ALDPENTAIL AQYLTLLNGM DSERAVAFLE EYAEKVPELA AVIFQEAGNV NLKRGRTEDA
LTMYFKATKA NPRYAEAYIS RAEIYQKQSK LQESLKEYKK LEDMGLADTY VYLRIGTLHV
LLKNIPEARK YFEKILSYDP SSILANQFMA AISEDEKNYA AALKYLQAAG DYKTNASKLL
QASFYAARMG NAEEASSILD NAYKVSDKSV EVGYFYAVSL QDLGKHKEAV KIFKEILSQT
PQYEKARMMY GVSLDALGDN AELEKQMRIV VGQNPANSEA LNSLAYALLE QNKKLKEAKK
HIDRSLQLKP DDYATIDSLG WYYYKTKDYD KALEYFEKAL SKMPDDKVIA GHKGLALYRL
GRYKEALPWI IKAEDKKLNK YIKKAEKKSG E