Gene Emin_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1473 
Symbol 
ID6263754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1569979 
End bp1571463 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content40% 
IMG OID642611958 
Product5'-nucleotidase domain-containing protein 
Protein accessionYP_001876358 
Protein GI187251876 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAA AAATAGTTTT TATATTAATA GCCGTTCTTT TTGCCATGGC CTGTAAACAG 
GAATATTTAA CAACTGTTAC CGTTTACCAT ACCGCGGGCA TAGGCGGCAT GTACTGGCCT
AGACAGGAGC CCGCTTATGA AAATAAGGAA ACGGGCGGCA TAGGCGTTTT AAAAAACTAT
TTTGAGCGCG AATACGGCAA AAAACTTTTG TTAGACAGCG GGGATTGGTT TTCCGAAACG
CCGGAAGGCT CTATCGGTAA AGGTTCTTAT ATTTTGAAAA TGATGCGCAT GGCGGGTTAC
GACGCTACCG GTTTAAGTTA TACGGACCTT GCGCCAGGCT GGGCGACAAT GTCCGAAAAT
ATCAGAAACA GCGAGTTTAC CGTTTTGTCC TCCAATATTA AAACAAGAGG GGGCGCGGTG
CCCTCTTATA TTCCCAGAAG CATTATAAAA GAAATAAACG GCGTAAAAAT CGGCATCTTT
TCTTTAGTCA TAAAAAACGA TAAGGAAAGC GCAGGCGGAA GACTTGGCGA AATTAAAATT
TTTGACGAAA TAGAAGCCGC CAAGGCTGTT ATTGATGATT TAAAGAATAA AGGCGCCGAC
GCGGTTATCC TTTTAGTGGA TGTTGCCCAA AATGAAAATA ACTATGAAGA AAGAGCAATT
TTGGAACAAG TTGAAGGCAT TAATATTGTT TTGGCCGGCG CGCCCTCGGG CGAAAAAACA
TCTATTGAAG AATATGACAA CGCTTACATA GTAAAAACTG AGCCTTTTTT AATTAAGCTT
ACAAAGCTAA AACTTAATTT TGATTTTAAT AAAAAATTGG CAGGAGTTGA GTTGGAAACG
ATACCTCTGT CTAAAGAAAA ATACGGGGAA GACGAAGCCA TAAAAAAAAT TGTTGATGAT
TTGCGTTCGG CTACTTTTAA ACGCTTAAAC CATGTTATAG CCGAAGCGGA AGATCTGATT
GCCGACGTGG ATACCGGGCC TTCGGTTCTT GGTGAAATTA TAGCCGGCTG TATTAAAGAC
TGGGCTAAAG CGGATATCGG CATAATTAAC TCTGATCCGC TGCGCGTGTC AATACCAAAA
GGGAAAATAA CGGAATACTC TTTATATGAA GTTTACCCCT ATAATGATAC GGTTATGTCC
GTAAGGATAA GGGGTGAGGA GCTTAAAAAC ATTTTGGAAA AAAGTTTACT TTCAAAAAAT
AATTTTCCTC AAATTTCAGG TATGACGGTT GAATATAATA TGAGCGAGCC GGAAGGCAGC
AAAGTTAAAT CCATAAAAAT AAGAGGGGGA AAGGTTTCCC CCGGCACAAT TTACCGTCTT
GCCACGACAG ACCATATTAT GGCGGGCGGC TTCGGTCATG ATGAATTTGT AAATGCCGTT
GAATTTAAAA ATACGCGCGT TGATATCCGC ACGCTGCTTC GCCAATGCTT ATACCGCAAG
AAGAAAATAT CTAATTTCCC GCTTAACTGG AAAGAAGTTA AATAA
 
Protein sequence
MSKKIVFILI AVLFAMACKQ EYLTTVTVYH TAGIGGMYWP RQEPAYENKE TGGIGVLKNY 
FEREYGKKLL LDSGDWFSET PEGSIGKGSY ILKMMRMAGY DATGLSYTDL APGWATMSEN
IRNSEFTVLS SNIKTRGGAV PSYIPRSIIK EINGVKIGIF SLVIKNDKES AGGRLGEIKI
FDEIEAAKAV IDDLKNKGAD AVILLVDVAQ NENNYEERAI LEQVEGINIV LAGAPSGEKT
SIEEYDNAYI VKTEPFLIKL TKLKLNFDFN KKLAGVELET IPLSKEKYGE DEAIKKIVDD
LRSATFKRLN HVIAEAEDLI ADVDTGPSVL GEIIAGCIKD WAKADIGIIN SDPLRVSIPK
GKITEYSLYE VYPYNDTVMS VRIRGEELKN ILEKSLLSKN NFPQISGMTV EYNMSEPEGS
KVKSIKIRGG KVSPGTIYRL ATTDHIMAGG FGHDEFVNAV EFKNTRVDIR TLLRQCLYRK
KKISNFPLNW KEVK