Gene Veis_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1038 
Symbol 
ID4690341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1153310 
End bp1154383 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content71% 
IMG OID639848817 
Productprotein of unknown function DUF513, hemX 
Protein accessionYP_995832 
Protein GI121608025 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.715062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCCG TATCGTCCGC CAACGATCCG CCCCCCGCCG CATCGGTCTT TGCCACCTGG 
CGCAGCAGTC TGGCCATCAC CGTGCTGGGC ACCATCGCCG CAGCCGCCGC GCTGAGCAGC
GGCATGCTGT GGCAAAAGCT CGGCGCCATC CAGGAACAAC TGGCCCGCCA GTCGGCGGAA
TCGGGCGCGC TGGCCATCGA GGCCCGCACC ATGGCCCGGC AGGCGCAAGA ACTGGTGCGC
GAGAGCGCTG CCAGGCTGTC GGTCGCCGAA ACCCGGATCA GCGAAGTGGC GCTGCAGCGC
AGCCAGCTCG AAGAACTGAT GCAGAGCCTG TCGCGCTCGC GCGATGAGAA CCTGGTGGTG
GACATCGAAT CGGCCGTGCG GCTGGCGCAA CAACAAGCGC AGCTCACCGG CAGTCTCGAA
CCCCTGGTGG CCGCGCTCAA AAGCGCCCAG CAGCGCATGG AACGCGCCGC CCAGCCGCGC
CTGGCCCCGG TGCAACGCGC GATGGACAAC GACCTCGATC GCCTGGGCCG CGCCAGCGTC
ACCGACACCG CCGGCCTGCT GGCCCGGCTC GACGATCTGG TGCGCCAGGT CGACGAGCTA
CCGGTGCAAA ACGCCGTGGC CCAGGTCGCG GCCGGCAGGC GGCAGTCCGG CCTGTCTGGC
GCCACCCCTG CCGCGCAGCC GGCCCACCAA GGCCGGCCGG CCTGGTGGCA CGCGGCGCTG
CAAAACGGCT GGGAAGTGGT GCGCGACGAG GTCCGGGGCC TGGCCCGGGT CAGCCGCATC
GACCAGCCCG AAGCCATTTT GCTGGCGCCC GAACAGGGCT TTTTTCTGCG CGAAAACCTC
AAGCTCAAGC TGCTCAATGC GCGCCTGGGC CTGCTGGCGC GCCAGTTCGA CGCGGCCCGG
GCCGACCTGA ACGCCGCCAC CGCCGCATTG AACAAGTATT TCGACCCCGC ATCGCGCCGC
ACACAATACG CGGCCTCGAT GCTGCAACAG GCCCAGGCCG GCATGAGGGC CGCCCCGTTG
CCGCGCCTGG ACGAAACCCT CTCTGCGCTG GCCACGGCCG CCGCAGGGCG CTGA
 
Protein sequence
MSPVSSANDP PPAASVFATW RSSLAITVLG TIAAAAALSS GMLWQKLGAI QEQLARQSAE 
SGALAIEART MARQAQELVR ESAARLSVAE TRISEVALQR SQLEELMQSL SRSRDENLVV
DIESAVRLAQ QQAQLTGSLE PLVAALKSAQ QRMERAAQPR LAPVQRAMDN DLDRLGRASV
TDTAGLLARL DDLVRQVDEL PVQNAVAQVA AGRRQSGLSG ATPAAQPAHQ GRPAWWHAAL
QNGWEVVRDE VRGLARVSRI DQPEAILLAP EQGFFLRENL KLKLLNARLG LLARQFDAAR
ADLNAATAAL NKYFDPASRR TQYAASMLQQ AQAGMRAAPL PRLDETLSAL ATAAAGR