Gene Emin_0483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0483 
Symbol 
ID6262775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp514984 
End bp516753 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content48% 
IMG OID642610953 
Producthypothetical protein 
Protein accessionYP_001875376 
Protein GI187250894 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2165] Type II secretory pathway, pseudopilin PulG 
TIGRFAM ID[TIGR02532] prepilin-type N-terminal cleavage/methylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000147239 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC GAGGATTCAC CTTAATAGAA ATAGCTGTAG TAGTTTTAGT AATAGCTATT 
TTAGCTGCGG CAGCTTTGCC GCAGTACAGA AAGTCGTTAG AACGCTCAAG AGCTGCCGAA
GCTTTTGACA TCCTTACAGA GATAAGGAAT AAACAAGAAG CCAGGGATTT ACTTGGCACG
GGGACGGCCA AAGGCTATAC TGTAAAGTTT AGCGATTTGG GGGAAGTAAT AGCGGGTAAA
AATTCCACAA CAAACACATT AGACACAAGC CTGTTTACTT ATACGCTTTC AAATAACCCA
TATCCGCAGG CATATGCCAA AAGGAAGGAT ATGGATTATT CGATAGTTCA GACAAAGGGG
TATCAAGACA GTGCGTTATG TTGTTTGGGA AGGGATTGTA AAGTTGTAGA CAGCGTTTTA
AAAGGGTGTG AGAAGACAGC GTGCCTTACA ACATGCGCAA CGGGATATAA AAAAACGGGG
TACTTTTTTT CGGAGGACGG GCCTTGCTGT GAAGCAAAAA CGTCGTGTCC GACAACATGT
CCTACGGGAC AAAAGAGAAG TAGCGTGCAG TATACGGAAG ACGGAGCGTG CTGCGTAGCC
AAAACATCAT GCCCGACAAC ATGTCCTACA GGCCAGCAGA GGACAAGCGT ACAATATAGT
GAAGACGGAG CATGCTGCGT ATCAAAAACG TCGTGTCCCG CAACATGTCC TGCAGGGCAA
AAAAGAAGCA GCGTGCAGTA TACGGAAGAC GGAGCGTGCT GCGTAGCCAA AACATCATGC
CCGACAACAT GTCCCGCAGG TGAAGAAAGA ACTTCTTCCC AGTATTATGA GGACGGAGCT
TGTTGTAAAA CAGCTACTGT ATATACATGT TCGGGCGCAA GCAGCCAGGC TTGCGGCAAC
TGCGGAACGC AGACAAGAAC ATGTGATACT TCAACCGGCA CATGGAGCGC GTGGAGTTCA
TGCTCGGGAG AAGGCGTTTG CAGCCCTGGT GCGACGCAAA GTTGTACAGG CGGCACGCAA
ACATGTTCAA GCACTTGCGC GTGGGGCAGT TGTGAAGTAG TATCAAAAAG TTGTTCGGGC
GCGTCCACGC AAACTTGCGG TAAATGCGGC ACGCAGACAA GAACGTGTGA TACCACAACA
GGAGTATGGA GCGATTGGGG CAGTTGTTCG GGAGAAGGTG AATGTATACC CGGTGAAAAG
AGAGATTACG GTTGCGGCTC TAAATCAGGA ACAAACTGGG CTGTTTGCGG CTCAGACTGT
AAAATGGGTG AGCCTGAAAA CAAATGCTCA GCCTGTGAAG GAAAGAGCAC CCAGTCTTGT
ACTTGCAACG GAATCCAAAC ACGTACATGC AATGAATCAA CAGGAACATG GAGTGCGTGG
GGAGCATGTG AAGGAGGACA AAACCCTTCC AATACGGCTT CCACAACAAC AAAATGCTCG
TTTAATAAAA CTATCCGATA CTCTTATGGT GATATGTACA GTAGCAGTAC AGGGGTAGCT
TATACTATTA ATGGCACTAA AACCAAAAGC TGGAATAAAT ATACTTGTCA GTGGCAGGAA
AGTAAATGCA GCGGTATGAT TTGGGTTTCT TTAGGCCAGC CTGCGGCCGT AGGCCAGGGC
GCGCTTTGCC GAAGTACGCA GGCCTGTTCC GCTCCCGGAA CGGGGTGGAC TTACGTGGGC
GGTTCGTGCA ACGGAAGCAG TTTGTTAAGA TGTTCGTCAA GCCCTAGTGA TTCATGTTAT
TTGTATAATT GCCAAAATGT ATCCCAATAA
 
Protein sequence
MKKRGFTLIE IAVVVLVIAI LAAAALPQYR KSLERSRAAE AFDILTEIRN KQEARDLLGT 
GTAKGYTVKF SDLGEVIAGK NSTTNTLDTS LFTYTLSNNP YPQAYAKRKD MDYSIVQTKG
YQDSALCCLG RDCKVVDSVL KGCEKTACLT TCATGYKKTG YFFSEDGPCC EAKTSCPTTC
PTGQKRSSVQ YTEDGACCVA KTSCPTTCPT GQQRTSVQYS EDGACCVSKT SCPATCPAGQ
KRSSVQYTED GACCVAKTSC PTTCPAGEER TSSQYYEDGA CCKTATVYTC SGASSQACGN
CGTQTRTCDT STGTWSAWSS CSGEGVCSPG ATQSCTGGTQ TCSSTCAWGS CEVVSKSCSG
ASTQTCGKCG TQTRTCDTTT GVWSDWGSCS GEGECIPGEK RDYGCGSKSG TNWAVCGSDC
KMGEPENKCS ACEGKSTQSC TCNGIQTRTC NESTGTWSAW GACEGGQNPS NTASTTTKCS
FNKTIRYSYG DMYSSSTGVA YTINGTKTKS WNKYTCQWQE SKCSGMIWVS LGQPAAVGQG
ALCRSTQACS APGTGWTYVG GSCNGSSLLR CSSSPSDSCY LYNCQNVSQ