Gene Emin_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1077 
Symbol 
ID6263495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1170925 
End bp1173297 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content40% 
IMG OID642611557 
Producthypothetical protein 
Protein accessionYP_001875966 
Protein GI187251484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA TATCCGAAAA CTTAAAAAAC GCTTTAGAAC TTTCTAATCC CAAATATATA 
AAGAAGGCTT TTTTATTCCC CCGTAAGAAG GACCCCGGCG GCGGCTTTAA TCCGGTGCAA
GATATATCGG AAAAGATTGT TGAAATATCC TCAATTAAAT GGAAATTAGA CAATGAGGAT
TACGCCGTGT GGAATACTCC GAATACGGTT ATAACGCTTT CAAATAAAAA TAATGAGTTT
AGGGAAGGAG GTTCTTTTTT TGAGGAAGAT TCTATAATAC ACAAGAGTAA AATTATTATT
TACGCCGGCG CGGTATGCCG GGGCGGCGCG GAAACGGTTC CTGTTTTTGA AGGATACATA
TTAAATTCGC CCGTTTATTA CCCTGAGGAA AAAACCGTAA ACTTTACTCT TTCCGGACGT
CTTGCCGAGT TACAGGAAAT TGATGCCGGC GAAATAAAAT TAACCTCATT AAATGAGGAA
GCTGTTATTT TAAATGAAAC TTCCGTATGC ACACAATTAA CTTCCGTTTC CAAGATAAAC
GAAATAAGGC GCGGTTTTTG TTTTAATGAC AGCGCGAATC TTCTTTCCCC ATATGATTAC
GAGGTTTCTG ATTTAAATAT ATATAAAACG CCGGCTGTAA TTAGTTTAAA AAATCCGCTA
AAGGAAAACG AAAAAATTTG GCTTTCATAT TCCCATTGGT ATGAAGATAA AGAAATGAAC
TGGATAATAA AACAAATAGC TGACGCCGCC ATGCTAGATA AAAGAGATAT ACAGGAGGTT
TCCTTCGGGT CAAATGTTAT TAATAAATTT TCAGAAGGCG CGGGACTTCC TTTTAAAGGC
AAATACGAAG GTACTGAATA TAACAAGACA AGCGTTACAT TATCATCCGA ATTTCCTTTT
GATGTTGATT TTGAATGGGA AGTTATGGAA ACCACCTCAA GTGTTTCCTG GAATTTAACG
GTAAACGGGG TGATGATAAA CGGCTCTTTA AGCGACGCGT GTGTATCGGC AAGAAGCCGG
CAGGACAAAG CCTGCGGAAC GTGGCAATTT TCCGCATCTC CCGATTGGGA CGGTGAAAGA
TGTTTTTACC ATTTTATAAG CGATAACGGA TTAAGGCAAA GTTCCAGCGG TTACGCTTTG
TCTTTTGAAA GGCGTATAAA CGGGTTTTTA ATTTTAAGGA TATACCGGGT AAATAACGGC
GTTTTAGCCT TGTTGGGAAG TAAGGAACAT TATTACGCTT TTAGGGTAAG CGACGTCTTT
ATAAGGATTG TAAGGTATGA AGACGGCAGC TTTCGAATAT TTAGCCGCCC TACTAAAACG
ATAAATGTTT CATTTTGGAC TGATCACGGC ATTTTGTGTT CGGATAATAC TTATAAAGTA
TCTAATTATC AAATAGCGGT TTTTTATTCA CAGGCGGGAG GCAATAATAT TTCCAATATT
AAATACGCTT CTTTTACGCC GGACTGGTAT TGTGATTACA GCCCGCAAGG CAGTTATACA
TCGGGGGAAA TTGATTTGGG CGGCAATTTC CGCTCTTGGG ATAAATTTGA ACTTTCACAA
ACGGTATCTT CCGGGGTAGA AGCTGCCTGT GAGATACGAT TTAAAGAAAC CGAACACGGA
GAATGGAGCG GATGGATAGC TATTTCTGAC GGAGAAATTC CTTCCGGCCA GGCCCGTTAC
GCGCAATTAA GATGGCTGGC AAAGCTAACC GTTAACAACG CCTCTTTAAA ACCTTATTTG
CATTCCTGGT CGCTCGGATG GAGAAGCAGT AAAGCCAATA TAGGAATGGT AAATACAAGC
GGTATGAGCG CGCTTGATGT TATGAAAGAA CTTTCCAAAT TAAGCACCTT TGAAATAGGT
TTTGACCGGG AAAGTAAATT TTTATTCCGC GCAAGAAATG AAGATAAAAA TAATTATATT
GAAGTAACTT CTAAAGATAT TGTGCGGGTG GAAAATATTA ATTCCGGCGT AGATTACGTT
TATAACGTTA TAAGCGCTGA TTTTGGCGGC TATAAAGCGA CAGCCTCTCC GCAAACAATG
GGGGAAGGGT TTCCCGATTC AATTGATATT AACGGAAGGC GTGAGCTTAG TTTAGCTTCA
GCTTCTTTAC TGCCGCCGGA CAGCGTTGAC ATGGCGGCCA CTATATCAGC AATAGTTTAC
GATTATTTGA GTAAAAGAAA AAAACGTGCT GTTATTATAA TCAAATTCCT TCCGCAATTG
GATTTGGGCG ATATCCTTAA AATAACTTAC GCCGAGCCTC TTATAACAAA CAAGCAGGAT
AAATCATTAA ACGGTGTTTT TATGCGCATA GAAGGGGTTG AGTTTGACCT TGAAAACTGG
CAAATGCGTA TTGACGCCGT GGAGGTTTTA TGA
 
Protein sequence
MQKISENLKN ALELSNPKYI KKAFLFPRKK DPGGGFNPVQ DISEKIVEIS SIKWKLDNED 
YAVWNTPNTV ITLSNKNNEF REGGSFFEED SIIHKSKIII YAGAVCRGGA ETVPVFEGYI
LNSPVYYPEE KTVNFTLSGR LAELQEIDAG EIKLTSLNEE AVILNETSVC TQLTSVSKIN
EIRRGFCFND SANLLSPYDY EVSDLNIYKT PAVISLKNPL KENEKIWLSY SHWYEDKEMN
WIIKQIADAA MLDKRDIQEV SFGSNVINKF SEGAGLPFKG KYEGTEYNKT SVTLSSEFPF
DVDFEWEVME TTSSVSWNLT VNGVMINGSL SDACVSARSR QDKACGTWQF SASPDWDGER
CFYHFISDNG LRQSSSGYAL SFERRINGFL ILRIYRVNNG VLALLGSKEH YYAFRVSDVF
IRIVRYEDGS FRIFSRPTKT INVSFWTDHG ILCSDNTYKV SNYQIAVFYS QAGGNNISNI
KYASFTPDWY CDYSPQGSYT SGEIDLGGNF RSWDKFELSQ TVSSGVEAAC EIRFKETEHG
EWSGWIAISD GEIPSGQARY AQLRWLAKLT VNNASLKPYL HSWSLGWRSS KANIGMVNTS
GMSALDVMKE LSKLSTFEIG FDRESKFLFR ARNEDKNNYI EVTSKDIVRV ENINSGVDYV
YNVISADFGG YKATASPQTM GEGFPDSIDI NGRRELSLAS ASLLPPDSVD MAATISAIVY
DYLSKRKKRA VIIIKFLPQL DLGDILKITY AEPLITNKQD KSLNGVFMRI EGVEFDLENW
QMRIDAVEVL