Gene Emin_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1504 
Symbol 
ID6263793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1596355 
End bp1597536 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content39% 
IMG OID642611992 
Producthypothetical protein 
Protein accessionYP_001876389 
Protein GI187251907 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000468012 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAA TTTTAAGTAT ATTTTTTCTT ACTTTGTGCG TGTCCTTATT CGCGCAAGAC 
AAGAAAGATA TTGAAAAAGA CATTATCAAA TATGGCGGCG CGGGCATACG CGCGGTCTAT
AACCTGGACT TTCCTACCGC ACAAAAAAAT ATAGATATCG CTTTTCAAAA ATACCCCAAC
CACCCGTACG CTCATTTTGG CAACATGCTT ATAGCCTGGG GAAGGTTTAC TTATGAATTT
GAAAAAAGCG ACCCGGAACA GAAAAAGATT TTTGAGTCAG TCCTTTCCTC TTCCATAAAC
GGGATTAACA TTTGGCTTAA GGATAACCCA AAAGATCCAA CCGCCTTTAT GGCCCTGGGC
GGAGCATACG GGTTAAAAGG CATGTTCGCT ATGGACAACA AAAACTGGGT TACGGCATAT
TTTTCAGCCA AAAAGGGAAT AAGTTATATG CGAAAAGCGC TTGAAGCGGA CCCGGAATTT
TATGACGCCT ACTTCGGCCT TGGCATATAT GAATATTACA CGGGCACGCT GCCTTCCGTA
ATTAAAGTTT TGGCTAAAAT AGTGGCTATA AAAGGGAATC AGACAAAAGG CATTGAATAT
TTAAATATTT CCAAAGAAAA AGGACAGTTC ACCTCTGACT CTTCAAAACT TATGTTAGTT
GAAATATATA ATAACAGGCT TTCACAGTTT TATAACCCGC AAGAATCACT TATGTATATA
AGAACTGTTT CCAACAAATA TCCGGCAAAT CCGCTTATGC CTTTTGTTGA AATTATAACG
GAATTTGAAA ATAAAAACTA TGATATTGTC ATTAAAAAAG CCAAGACTTT TATAAATAAA
ATAGGCGCTG CTCCTTTCTA TACAACTATG TATATTCCCC GTTCGTATAC CGCTATAGGC
ACGGCGCAAA TGGCTAAGGG AGAATGGGAA CAGGCATTAA AAACATTTGA AAACGCCAAA
GCTATTTCTT TTAACAAAAG TGAACCTACC AGATGGGCGG TTTGGAATTT AATAAGATTA
GGACAGTGCT ATGACGCATT AGGGCAAAGG GAAAAAGCCT TATCCACATA TAGAATGGTA
ACAGCTATGC CAAACACCTG GGAGCTTAAT GACGAGGCTA AAAAATTTAT AAAAACCCCG
TTTACCAAAG AAACCGCGCT CGGCCCGCTG CCCCCTCTTT AA
 
Protein sequence
MKKILSIFFL TLCVSLFAQD KKDIEKDIIK YGGAGIRAVY NLDFPTAQKN IDIAFQKYPN 
HPYAHFGNML IAWGRFTYEF EKSDPEQKKI FESVLSSSIN GINIWLKDNP KDPTAFMALG
GAYGLKGMFA MDNKNWVTAY FSAKKGISYM RKALEADPEF YDAYFGLGIY EYYTGTLPSV
IKVLAKIVAI KGNQTKGIEY LNISKEKGQF TSDSSKLMLV EIYNNRLSQF YNPQESLMYI
RTVSNKYPAN PLMPFVEIIT EFENKNYDIV IKKAKTFINK IGAAPFYTTM YIPRSYTAIG
TAQMAKGEWE QALKTFENAK AISFNKSEPT RWAVWNLIRL GQCYDALGQR EKALSTYRMV
TAMPNTWELN DEAKKFIKTP FTKETALGPL PPL