Gene Emin_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1474 
Symbol 
ID6263974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1571464 
End bp1572939 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content36% 
IMG OID642611959 
Product5'-nucleotidase domain-containing protein 
Protein accessionYP_001876359 
Protein GI187251877 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TATCTTTGCT TTTAGCTTTT GCTTTAACCG CGTGCTTTGT ATTTTCCAAA 
CAATTAATTA TTTACCATAC CAGCGATACC CATGGATTCT ATTATCCTGA AAGAAATACG
GAAAACAACA AAATGTGGGG CGGTTTTGCC GCGGCAAGAA ATGTTGTTAA TAAAGAAAAG
CTTCCTTTTT TATTTTTAGA CAGCGGTGAT TATTGCAATG GTACGGTTGA GGCAAAAAAC
TCAAAATGCG TAACTTCGGC AGAACTTATG AACGCCATGG GTTACGACGC TACCACAATA
GGCAACCATG AATTTGATTT TGGCGAGGAT AATTTTTTAA AAGTGCTTCC TTTGTTTAAA
TTCCCCGTGC TTAACTCAAC AATTACGGAC AGCAGGCTTA AAGGGCAGCT TCCTTACACA
AAACCTTATA AAATCTTTGA AAGAGCCGGC GTTAAAATAG CCATAATCGG CGTGGGTAAA
GAGGGTGATA ATAAACACTT TAAATTCGCA AATGTTATAA GCACTGTAAA AAAAGTTGTT
AAAGAGGTAA AAAAAGAAAA CGCCGATATT ATTATTTTGC TTATACATGA TTCCGCCGGC
GATGAAAAAC ACCCGCAAAA AGTAAGCAAT AAATTAATTG CGGAAAAAAT ACCTGAAATA
GATATTGTTT TAGGCGGCCA CGCCCACCAG GAATACCAAA ATATTTTTGT GGGCAACGCT
ATTTTGGTGG AATCGGGATG CCATTTAAAG AAGATGTCTA AAATCGTTGT TGATATTGAT
GATGAAACCA ATAAATATAA AACAGCCAAA TCTGAACTTA TACCTTTATA TATAGAAAAA
ACAGGGCAAG ACGAACAAAT TAAAGAACTT GCCGAAAGTT TGAGAGTTCC GGGTATGGAC
GTTGTTTTAG GCAACACGGC TGCGTATATA AGCAAAACGC CGGTAAAGGA AGGATGCAAA
GATTCCCCCA TTAACAATTG GATAGCCGAT GTTATAGCCA AAAACGTTGA AGGAGATTTT
ATTGTCCATA ACGTGGGCGG CGCCAGAATA GGGCTTGAAA AGGGGCCTGT TACCATGCGC
GATATTGTTA CTTTATTTCC TTTTGATAAT AAAATAGCCG TTGTTGAAGT TGACGGAAAA
TTTGTTAAAA ACTTTTTTAT AAACGGCATT AAAAACGGCC GCGCTTTATA TAACTTCCAC
GGGTTAACCG CAAAGTTTAA ATTAAAAAAT AATAAAGTTA AAAATGTTGA AATTTTTATA
AACGGCAATC CTTTGCAGGA AAACAAAACT TATAAACTTG TTACTAATGA ATATATCGCC
AAAGGTAAAA CCGAAGGCTG GATGTTTAAA AAAATTGAAG AGGATAAAAA ACAGTTTATT
TCGCTTAGCA TTAGAGATAT GCTTATAGCC GATTTAAAAG CGAACTCGCC TTTAAAACCT
TTAAATGACG AGTGCCGCCT CCAGGTTAAA AATTAA
 
Protein sequence
MKKISLLLAF ALTACFVFSK QLIIYHTSDT HGFYYPERNT ENNKMWGGFA AARNVVNKEK 
LPFLFLDSGD YCNGTVEAKN SKCVTSAELM NAMGYDATTI GNHEFDFGED NFLKVLPLFK
FPVLNSTITD SRLKGQLPYT KPYKIFERAG VKIAIIGVGK EGDNKHFKFA NVISTVKKVV
KEVKKENADI IILLIHDSAG DEKHPQKVSN KLIAEKIPEI DIVLGGHAHQ EYQNIFVGNA
ILVESGCHLK KMSKIVVDID DETNKYKTAK SELIPLYIEK TGQDEQIKEL AESLRVPGMD
VVLGNTAAYI SKTPVKEGCK DSPINNWIAD VIAKNVEGDF IVHNVGGARI GLEKGPVTMR
DIVTLFPFDN KIAVVEVDGK FVKNFFINGI KNGRALYNFH GLTAKFKLKN NKVKNVEIFI
NGNPLQENKT YKLVTNEYIA KGKTEGWMFK KIEEDKKQFI SLSIRDMLIA DLKANSPLKP
LNDECRLQVK N