Gene Emin_0454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0454 
Symbol 
ID6262593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp489725 
End bp491296 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content43% 
IMG OID642610925 
Producthypothetical protein 
Protein accessionYP_001875348 
Protein GI187250866 
COG category[I] Lipid transport and metabolism 
COG ID[COG2267] Lysophospholipase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00000000520171 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAAA AAATCTTTAA GCTTGTTAAA TATACGGTTA TAATTTTTAT AGTATTGTTT 
TTTCTGTTGC TTGCAGGAGT TGCTTTACGT CTGCATTATG TGCCTGATTT GCAGCCGTGG
CATACATTTG CGCCTAATGA ACTGACCGTC AAGGAATTGG AAAAAGCCAC ATGGCAAGAT
TACACCGCCC GTGAAAACAA AATATTTGAA GAAGTTCATA AGAACGTAAT TTTAAAAACG
CCCGAAAGCG AGCAAAACCA AATCAACCGT TATTTCAAAG GCAGTAAAAT ATATCCCGGC
AATTTTAAGC AAGACTGGAA CCGTTCTTAT CTTTTAATAC CGGAAAATCC TAAAGGGGCG
GTTGTTTTAA TACACGGCCT TACGGATACT CCTTACAGCC TGCGTCATAT AGCGGAGATA
TATTATAAAA AAGGCTTTGT TGCGGTGGGG TTAAGGGTTC CCGGGCACGG TACGGTGCCT
GGCGCGCTTA CCAAAAGCGT TTGGCAGGAT TGGGCCGCTG CTACCAAATT CGCGGTAAAG
GAAGCCAAAA AGCTTACGCC GGAGGGGGCG CCTCTTCACA TAGCGGGTTT TTCACAAGGC
GGCGCGCTTG CCGTAAAATA CGCGTTGGAC GCGCTTGAGG ATGACTCTCT TATAAGGCCT
GACAGACTGG TTCTAATTTC TCCCATGATA GGTATTACAA GGCTTTCCAA ACTGGCCGAG
ATACTTGCCA TTCCATCAAT GCTTCCAGGG TTTGAAAAAG CCGCGTGGAT CAGTATTATT
CCCGAGTTTA ACCCTTTTAA ATACAACTCT TTTCCTGTTA ATGCGGTTAA ACAGGCCCGT
CTTTTAATTG CGGATGTGAA AAGGCAAGCC ATAAGGCTTG GGCAAAAAGA TATGTTAAAG
GAGCTTCCGC CTATAATTAC ATTTCAGTCA ATAGCGGATT ACACGGTCAG TACTCCCGCT
ATTATAAATG ACCTTTATAG CAATCTGCCG GAAAATGAAA GTGAGTTGGT TTTGTTTGAT
ATTAACCGCG ATACAGCGTT TTTACCGTTG GTGCGTCCTG TTTTTGTTAA TATGATGTCA
ATAATGTTAC CGGGATTTCC GCAAAAATAT AAGATAACTG TAATAGGCAA CTCCGGCCCC
AATGATTCAG GCGCGGTAGA GCGCAGCGTC GAGCCCGGCG GAGTGGATTT TAAAACAAGA
GCGTTGGGTC TGGTTTATCC TAAAGAGTTG TTTTCTCTTT CTCATATTTC GCTTCCTTTT
CCGGAAACTG ACCCGTTATA CGGCTCTATT CCTGACCCGG AAATAAAAGA TGCTTTTGGC
ATAAATCTGG GCCTTATATC AAACGCGCAG GGAGAACGCG GCGTTTTGGG AATAAACACC
AATTTATTTT TCAGAGTTTC ATCAAACCCT TTGTTTTCCT ATATTACCGC ACGCATAGAA
GACATTATTG ACACCTTTCC CCAGGAGACG GAAAAGACAT CCGCAGGGGC GTCTGCTTCT
AAATCTAAAA TTACGCAAAA ACAATACGGC GATATTATGA AAGCCGCTGA TTATAAAGAT
GAGCCTTTTT AA
 
Protein sequence
MNKKIFKLVK YTVIIFIVLF FLLLAGVALR LHYVPDLQPW HTFAPNELTV KELEKATWQD 
YTARENKIFE EVHKNVILKT PESEQNQINR YFKGSKIYPG NFKQDWNRSY LLIPENPKGA
VVLIHGLTDT PYSLRHIAEI YYKKGFVAVG LRVPGHGTVP GALTKSVWQD WAAATKFAVK
EAKKLTPEGA PLHIAGFSQG GALAVKYALD ALEDDSLIRP DRLVLISPMI GITRLSKLAE
ILAIPSMLPG FEKAAWISII PEFNPFKYNS FPVNAVKQAR LLIADVKRQA IRLGQKDMLK
ELPPIITFQS IADYTVSTPA IINDLYSNLP ENESELVLFD INRDTAFLPL VRPVFVNMMS
IMLPGFPQKY KITVIGNSGP NDSGAVERSV EPGGVDFKTR ALGLVYPKEL FSLSHISLPF
PETDPLYGSI PDPEIKDAFG INLGLISNAQ GERGVLGINT NLFFRVSSNP LFSYITARIE
DIIDTFPQET EKTSAGASAS KSKITQKQYG DIMKAADYKD EPF