Gene Emin_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1120 
Symbol 
ID6263958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1220546 
End bp1222351 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content44% 
IMG OID642611600 
Producthypothetical protein 
Protein accessionYP_001876009 
Protein GI187251527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATAA AATTTTTACT TCCCGCGGTT TTAATGTTTT TCTGCGTTTT CGCAAATGCG 
CAAATAAGGG TTGCCGCTTC CGTAAATGAA ACCGTTATGG AAGTGGGGGA AGAATATTAT
CTTACAATTA CCGTAACCGG CCCACCTAAC GACACTTTTG CCCCCGCTCT GCCCTCAATG
CCAAATTTTA ATGTTTATTA CGGCGGCCTT AACATAGCCA GCGCTGTTGA CGAAAGCGGC
TCTTATGTGC CGGAACTTCA CTATAATTAC CGCATAATAC CCAGGTTCCC GGGCAAGGCG
GAAATAGGCC CCGTTAAAAT TGAATATTTG GGTAAAGAAT ATACTTCCGA ACCTATTAAC
ATTAACGTTT ACAGAACGGG TGAGGCCGCC GGTAAAGAGC CGCCCAAACA AGCGGAAACA
AAAAAAGCAG CTTCTTCAAG ACCTGTTGTA CCCCGCGTTT CCGGGCAAGA AAAGAGCGCT
CCCGATATTC CCAAACAACA AAGTTATGAC GATGTTTTTT TGCTTGCCAA AACGGATAAA
AAAGAAGCCT ATGTGGGTGA GCAAATTACA TTAACCACTA CTTTTTACAG TTCCTATTCT
TTGGAAGGGT CCTCTCTTTA TTACGCGCCC GCTATTGAGG GGTTTACAAA AGAGGAAATG
GACGGCGACG TGGGCAAAAC AATGTTAGCC GGGCAGGAGT ACCTTTATAA TACCGTGGAA
ACAGCGCTTT TCGGCGTTTC CCCCGGGATA GGTACGGTAG GGCCTTCAAG GGTAGAATAT
ACCGCGTCCA AAAGCAGGGG AATGCCTAAA CTTGACAGCC TTTTGGGCAG GATTTTAGAG
CCTGGCTCCG TAAAAAGCGT GCCTTTGGAA ATAGGTATAA AACCGCTTCC CATGCAGGGC
AGGGATAAAA CTTTTACCGG GGCTGTGGGG GAAAAGTATT CAATAACCGC CTCCTTAGAC
CGTGATAATA TTGAAGTAGG CGAAGCCGCC ACTCTTACTT TAACCGTAAG GGGAGTGGGA
AATTTAAAAA TGGTGTCGCC TCCGCTAATT CCGGAAATAG AAGGCTTTAA ATCTTACGAA
GCGGCAGGTA ATTCAAACGT GGCACCGGTT AAAGGCGTTG TGCAGGGTAT GAAAACATTT
AAAACCGTTT TAGTAGCTAC CGCGCCCGGT GAATTTACTG TGCCTTCCAT CGCTTTTTCC
TTCTTCAGCC CTACAAATGA GAAATACGTA AAAGTAAATT CGGAGCCTTT AAAAATAAAG
GTTATTCCTT CAACCGGAGG GGTGGATAAT GTTATTTCTT ACGGCGGGGC GGCGCAGGGC
CCGTCCGCAA AAGCATATTT AACGGATATA AACTACATAA AGCAAAACGC TTGGGTAAAT
AAATTTAACC TGCTCCTTTT CTTTAACGAT TTGGGCCGTT TTAACCTTAT TCCCGTACTT
ATAGGGGCTA TAATTTTGTT TATTAAAATG CTTTCCAAAA GCGGACTTTT AGACAACCCT
GTAATAAAGG CTAAAGCGGC GGTAAAAGGC GCTAACGACG TTGAAACCCT GTCTTTGGCG
GTAACCCGGT TTGTTAAAGA CAAGACGGGG TTTTCCATGG GGAGCATGAC AACCAAAAAT
CTTGTGGCTG CTCTTTCTTC CAAATACAAT GTTTCCGCGC TTACCCTACA AACGCTTGAG
GATGTTTTAA ACCAGCTTAA CGCGTTTAGG TTCGCTCCCG CGGGCACGGT AAACGCTTTA
GAGTTTGATA ACGTTAAACA AAAAACTTTA CAAGTTTTAA AAAATTTGGA GCGTGAGATA
AAATGA
 
Protein sequence
MRIKFLLPAV LMFFCVFANA QIRVAASVNE TVMEVGEEYY LTITVTGPPN DTFAPALPSM 
PNFNVYYGGL NIASAVDESG SYVPELHYNY RIIPRFPGKA EIGPVKIEYL GKEYTSEPIN
INVYRTGEAA GKEPPKQAET KKAASSRPVV PRVSGQEKSA PDIPKQQSYD DVFLLAKTDK
KEAYVGEQIT LTTTFYSSYS LEGSSLYYAP AIEGFTKEEM DGDVGKTMLA GQEYLYNTVE
TALFGVSPGI GTVGPSRVEY TASKSRGMPK LDSLLGRILE PGSVKSVPLE IGIKPLPMQG
RDKTFTGAVG EKYSITASLD RDNIEVGEAA TLTLTVRGVG NLKMVSPPLI PEIEGFKSYE
AAGNSNVAPV KGVVQGMKTF KTVLVATAPG EFTVPSIAFS FFSPTNEKYV KVNSEPLKIK
VIPSTGGVDN VISYGGAAQG PSAKAYLTDI NYIKQNAWVN KFNLLLFFND LGRFNLIPVL
IGAIILFIKM LSKSGLLDNP VIKAKAAVKG ANDVETLSLA VTRFVKDKTG FSMGSMTTKN
LVAALSSKYN VSALTLQTLE DVLNQLNAFR FAPAGTVNAL EFDNVKQKTL QVLKNLEREI
K