Gene Emin_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0007 
Symbol 
ID6263502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp8828 
End bp10114 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content33% 
IMG OID642610469 
Producthypothetical protein 
Protein accessionYP_001874912 
Protein GI187250430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.325171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.000568724 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAAACA ATAAGAATAC AAAGCAATAT ATTTATATTG TTCAAGCCTT GTTAGAGCCC 
TCAAAGTGTA AAATAGGAAA AACAAAAGAT TTTGAAAGAA GATTGAAAGA ATATAACAAC
ATGACAGGAA AATCAAACGA AAATAGTTAT CGGTATCTGT TTGCCTGTGA AGTGAAAGAT
ATGTCGGAAT TAGAAAAAGA TATTAAAGAG AACTTCTCTG CCCTAAGAGA GCAAAAGAAG
AGAGAAATAT ATTTTTATAA TCCAAGCTTG TTTGATGGTT ATGTTAAATT TATAAAAACT
CATAAAATGT TTATTAAAGA GATTTTTACA AAAGAGGAAG ATAAAAAAAT AATAATAAAA
ATAGTAAAGA AAACCACGCC GTCACTTGAA GAAAGAGGGA TAACCCGAAA AGATGTAATG
CAAAAGGCGC AAAAGGTAGA TAATGACGAA TTTTATACCA GATTTGAAGA TGTCGAAAAA
GAACTTTCAC TGTATGATAT ATCGATTTGG AAAAACAAAA TAGTTTTTTG CAATTGCGAC
GACGCCGTGG ACAGCGATAA TAAAAAAACA TCTGCGTTTG CCTTGTATTT CATACAAAAA
TTTAAAGAAT TTGAATTGAA AAAACTGATT TGCACTCATT ATAGCGGTGT TGTTGATTTG
TTTAATCAAG GGGCAAGCGG GTATATATTT ACAAAGGACG GTTTTCGGGA ATTTAAGGAC
TATCCTAAAG GGTATACGGG TAGTTTTGAT GACCCTCTAT CTTTGAAAAT CCTTAATAAA
GAGGCGGATA TAGTATGCAC AAATCCCCCG TTTTCCCGAG CTAAAGATTT TTGGAGAATG
TTAATCGGAA GTTCTAAAAA ATTTATAATT ATATCTAACA TATCTAATCC CATAACTCCT
GCGTATATTA AATATTTTAG GGAGAATAAA GTATGGGCAG GCTATAACCG TGTGGATAAA
TACCTCAACC CCAAACGGGA ACTTGTAGAA GCATCCGGAC ACTGGTATAC TAATATTCCT
ATCCAAGACA GGCCAAAATC TAAAAATTTA AAAATAATCC CGCTAAAAGA CGTGCCCGAT
AAATATAAAA AGTTTGATGA CAATAAAACA TTATTAGTAG ATAATTGTTA TATTCCTGAT
GATTACAACA AGCCGTTTGC GGTGTCCGCC CGCCCGATAT TAAACGGCTT ACTGGAAAAA
GGATATAAAA TTGTTGAAGA TACCCAATAT TTTCCGTACG AGAACGGTAA AAAAGGGTTT
GGGCGGGTTT TGGTAGAGAA AATATGA
 
Protein sequence
MQNNKNTKQY IYIVQALLEP SKCKIGKTKD FERRLKEYNN MTGKSNENSY RYLFACEVKD 
MSELEKDIKE NFSALREQKK REIYFYNPSL FDGYVKFIKT HKMFIKEIFT KEEDKKIIIK
IVKKTTPSLE ERGITRKDVM QKAQKVDNDE FYTRFEDVEK ELSLYDISIW KNKIVFCNCD
DAVDSDNKKT SAFALYFIQK FKEFELKKLI CTHYSGVVDL FNQGASGYIF TKDGFREFKD
YPKGYTGSFD DPLSLKILNK EADIVCTNPP FSRAKDFWRM LIGSSKKFII ISNISNPITP
AYIKYFRENK VWAGYNRVDK YLNPKRELVE ASGHWYTNIP IQDRPKSKNL KIIPLKDVPD
KYKKFDDNKT LLVDNCYIPD DYNKPFAVSA RPILNGLLEK GYKIVEDTQY FPYENGKKGF
GRVLVEKI