Gene Emin_0968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0968 
Symbol 
ID6263799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1062821 
End bp1064443 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content45% 
IMG OID642611448 
Producthypothetical protein 
Protein accessionYP_001875858 
Protein GI187251376 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.000000116164 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGAAAA ACACAGCATT AATTACACAG CTTAAAGAAA GCGGGCAGGA TTTTGAGTTT 
TACCCAACTA CCAAAGAGAT GGTAGCCACA ATATACTGCA ATGCCAAAGA GCGCAATATA
GACAGCGTTT TGGACATCGG CGCGGGCAAT GGCGGCTTTT TAAATGAGTT TGCGGCTCTC
TGCAAAAGCG AAAAAGAAGG AGGGCGCGAA ATAACAAAAT ATGCGATTGA AAAGTCCGAT
ATTTTAATGA GCCGCTACCC CGCCGATATT TTTATTCTAG GCACAGACTT TCACCAGCAG
ACTCTTATAG ACAAAAAAGT GGACATGATT TTTTGCAATC CACCGTATTC ACAATATGAA
CTCTGGGCGG CCCGGATTAT CAAGGAGGCT AATGCAAAGC ACGTTTATCT GATTATCCCA
CAGAGATGGA AAGATAACGA AGTAATCAAA GCCGCGCTTA AAAAGCGTAA GGCCCGCCAC
TATGTCATAG GCCACACAGA TTTTAAAGAC GCTGAACGCG CCGCCCGCGC CGTGGTTGAC
ATAATCAGAG TTGACCTCTG CGAAAGCTAC AGAGACCGTG CAGAAGTTGA CCCATTTAAA
ACGTGGTTTG AAGATTTTTT TAAGTTGCAA GCTGACAAAA CGGACGATAA CGCCTATGCT
CGCGCAGAGC GTAAAGCCGA TGATATTAAA AATGGACTTG TGAAAGGTCA AAACCTCATA
GATAATCTTT GCGAACTCTA CGGCGCAGAT ATGGCAAAAA TCCTTAAAAA TTACAGAGTG
CTTGAAACGC TGGACGCTGA TATATTCAAA GAGCTTAATG TCTCAGTAGA AGGACTTTGT
GAGAGCTTAA AGACAAAGTT AGAGGGCACA AAATCCCTTT ATTGGAAAGA ACTTTTTGAC
CACATGGACA CTATCACCGG CCGCCTTACT AGCGGCAGCC GAGCGGACCT GCTGAATACC
TTACAAGAAC ACACCAGCAT TGATTTTACT TCGTCCAATG CCTACATGGT TGTCCTCTGG
GCGATTAAGA ACGCCAACCG ATATCTGGAC AAGCAGCTTG TGGATATGTA TTTGGATTTG
AGCGATGCGG AGAACGTTTC AGCTTATAAG TCCAACAAAG TTTTTAAGAC GGACGATTGG
CGGTGGAATA GGCGGAATGA GAGAAAAGAA ACGCATTATA AGCTGGACTA TCGTATTGTC
AAAAATGTGT GGAAGTGTTT TGCACAGAGC AGTTATGAAG CATATTCATA CCCTAATGGC
CTCCACAATG ACTGCCATGC GCTTTTAAAT GACCTTTGCA CCATAGGCAA GAACCTCGGT
TTCAGCGTTC ATCAGAACAG CTTTAACTTT GAATGGACAC CCGGCGGACA GCGCACTTTT
GAATATGGAG ACGACAGCAA GCCCTTCATG GAGGTGCGCG CTTATAAAAA GGGCTCTATC
CATATTAAAT TTGCGCTTGA GTTTTCAAAG GCTTTAAACA TTGAAGCCGG GCGCATATTA
GGCTGGCTCC GCAACGCTGA GGACGCAAGC CAAGAGCTTG ATATACCCAC ACACGAGGTC
AGCTCGTATT TTAACAGAAA CCACGGGGCT AAGCTGGGCC ACGATATAAA ACTTTTAGCT
TAA
 
Protein sequence
MQKNTALITQ LKESGQDFEF YPTTKEMVAT IYCNAKERNI DSVLDIGAGN GGFLNEFAAL 
CKSEKEGGRE ITKYAIEKSD ILMSRYPADI FILGTDFHQQ TLIDKKVDMI FCNPPYSQYE
LWAARIIKEA NAKHVYLIIP QRWKDNEVIK AALKKRKARH YVIGHTDFKD AERAARAVVD
IIRVDLCESY RDRAEVDPFK TWFEDFFKLQ ADKTDDNAYA RAERKADDIK NGLVKGQNLI
DNLCELYGAD MAKILKNYRV LETLDADIFK ELNVSVEGLC ESLKTKLEGT KSLYWKELFD
HMDTITGRLT SGSRADLLNT LQEHTSIDFT SSNAYMVVLW AIKNANRYLD KQLVDMYLDL
SDAENVSAYK SNKVFKTDDW RWNRRNERKE THYKLDYRIV KNVWKCFAQS SYEAYSYPNG
LHNDCHALLN DLCTIGKNLG FSVHQNSFNF EWTPGGQRTF EYGDDSKPFM EVRAYKKGSI
HIKFALEFSK ALNIEAGRIL GWLRNAEDAS QELDIPTHEV SSYFNRNHGA KLGHDIKLLA