Gene Emin_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1123 
Symbol 
ID6263473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1223995 
End bp1225116 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content40% 
IMG OID642611603 
Productvon Willebrand factor type A 
Protein accessionYP_001876012 
Protein GI187251530 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATAT TTTTAAAAAT ACTTATAGTA TTTATGGTTC TTTTAATCCT TAACCTGCTG 
GCTATGATTG TTGCCAAACA GGGTAAGGCA AAGGACTCTT TATATAAATT TTCTTTTTTA
CTTTTTGTAG TGACAGTGCT TTTAATGCTT TTAAATATTG TTATTTTTAA ATTTTTTAAA
GGTACTGTTT TCGCTAGTCC TTATTTGTTA TTTTTGATTT TGCCGATACT GCTGTTTTGG
CTGGCGTATC CTTTTTTGCA AAAACTATAT GCGCCGGGGT TAAACTATAA TTTGGCTTAT
AAGCCGGAAA CTTCAATACC CGCTTTAACG GCAAAATATT TTTGTTTTAC ATTAATAACG
CTTGGTCTTA TATTTGCCGT TTTGGCTCTT GCCAAACCGA GGGACGCGCA AAAAACAGTT
TTACCTCCTA CCGAAGGCGT GGATATTATT TTAGCTATAG ACACTTCAGG CAGTATGGCT
GCGCAGGATT TTGACCCTAA CAGAATAACG GCGGCCAAAG TAGCCGCGGC CAACTTTATA
GCCAACCGCT TAAGCGACCG TATAGGTATA GTTGTTTTCG CTTCGGACGC TATGTTGCAA
AGCCCGCTTA CTTTAGATTA TGAGTCGCTT TTGGACTTTT TGGCCGACGT TCGTATCGGC
ATGGTCAGGA CGGACGGTAC CGCTATAGGA GACGCTATTG CCGTTTCCTC TGTACATCTG
GAACGCAGTC CCGCAAGAAG CAAGGTGATA ATTCTTTTAA CGGACGGGGA GTCAAACAGC
GGCGTAATTT CCCCTCTGGA CGCGGCCAAA ACCGCCGCTT TATACGGCAT AAAAGTTTAT
ACCATTGCTA CCATAAGTAA AAACAGCCGT GACTCGCTTG ATTTTAAACC CGATGATTTG
GAACAAATAG CCAAACTTAC GGGCGGCAAA TATTACCGCG CGTATAATGA GGCGGAACTG
ACAAAAATTT ACGCGGAAAT CGACAGCCTT GAAAAAACGG AATTTAAAAA CAGCGTGCTT
GTTAATTACC GCGAAAGATA TCTGCCGTTT TTAGCTCTTT CACTTATTTT AATATCGTGC
GGATTTATAT TTTCCAAAAT TATTTTTATG AGGGTGCCCT AA
 
Protein sequence
MSIFLKILIV FMVLLILNLL AMIVAKQGKA KDSLYKFSFL LFVVTVLLML LNIVIFKFFK 
GTVFASPYLL FLILPILLFW LAYPFLQKLY APGLNYNLAY KPETSIPALT AKYFCFTLIT
LGLIFAVLAL AKPRDAQKTV LPPTEGVDII LAIDTSGSMA AQDFDPNRIT AAKVAAANFI
ANRLSDRIGI VVFASDAMLQ SPLTLDYESL LDFLADVRIG MVRTDGTAIG DAIAVSSVHL
ERSPARSKVI ILLTDGESNS GVISPLDAAK TAALYGIKVY TIATISKNSR DSLDFKPDDL
EQIAKLTGGK YYRAYNEAEL TKIYAEIDSL EKTEFKNSVL VNYRERYLPF LALSLILISC
GFIFSKIIFM RVP