Gene Emin_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0644 
Symbol 
ID6263174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp711520 
End bp712623 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content50% 
IMG OID642611115 
Producthypothetical protein 
Protein accessionYP_001875536 
Protein GI187251054 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000239983 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTTA TAATTAAAAA AGGGTCTAAT TTTAAAAATA TGGACAAGAA AAAAGCCTAT 
ACTTGGGGGG CTTGTGCTTT GGTTTTTTTG CTTGTTCTTT TTACTTTAAT CGGCGCTATG
GCCGGTAGTG ATGAAGGCAA GCCCGACGAT TTTAGCAACC TTTCTTCCCG AAATTTCGAC
TTAGCGCAGC TTCCGTTTGT AAACGATGAG GCTGAAAAAG AATTGCTTGC GAAATATAAT
GATATCAGCG GTGTTCCGGA CAGCACTCTT TTTACCCCGG AAGAAAAGGA AGCCCGCCAG
GAGGCGGACG CTTTATCTGA AGAGGAAGCC CCCGACGCGG AGTATGAGGC CGCTTTAAAG
GAGTTATCCG CGCGTAATAC CCCTGCGCCC GCACCTGCTT CGTCTGCTTA TAGCAGTTAC
GGTTCCGGCG TAAGCAAACC TGCTACACAA ATAGGCACAA TGAGTAAAGG TTCAATGGTC
AGCGGAGGGG GAGGCGGTTT AAGGGGCACA AGCTGGACGC CCGGTAATGC TTCTGCGTCT
AATGCGAAAA CGGCTAACAC AAAGGTAAGC AAAGAAATGC TTGCCAAGCT CAGCAAAACG
GAAAGGGGCC GAAGTTTATT ACAGGCTTAC GCGGAATCTT CAGCAGGCGC TAAAAAAGAC
GGTGAGGGCG CCTTATCCGG GGCTATGGCT GCTTTTCAGG GCGGTAAAGC CACAGCGGAA
CTTGATACGG ATTTAGAAAC GGCTATGGCG GAGCTTGCGC TTGATGAAAC CGCCGGGATT
GGGGGCCAAG CTGCCAGTGA AGGGCCTTCA ATAGGTGATG TCGCCAAGGC TGTTAAAGAT
GGGCAGGAAA AGAAAGACAG GCAAATCCCC GAACCTAAAC CGAGTTTTTG GGCTGAGCTT
GGAAAACAAA TGCTAAAAGG TTTGGTTGAC GGCGCTACCC AAATTGCTAT TGGGCAAGCC
AACCAGCAAA TATCTATAAA AACATGTGTG AAGGGTTCTA AAGAGTACGG TTTTAACGCG
GCGGATTGTT TCGCTAAACC GGGAGGAGGA TCGTCCGGAA TTTCAAGCGG GGTCGGAGCT
TCAAGCGGGG CGGCAGGCTC ATAA
 
Protein sequence
MALIIKKGSN FKNMDKKKAY TWGACALVFL LVLFTLIGAM AGSDEGKPDD FSNLSSRNFD 
LAQLPFVNDE AEKELLAKYN DISGVPDSTL FTPEEKEARQ EADALSEEEA PDAEYEAALK
ELSARNTPAP APASSAYSSY GSGVSKPATQ IGTMSKGSMV SGGGGGLRGT SWTPGNASAS
NAKTANTKVS KEMLAKLSKT ERGRSLLQAY AESSAGAKKD GEGALSGAMA AFQGGKATAE
LDTDLETAMA ELALDETAGI GGQAASEGPS IGDVAKAVKD GQEKKDRQIP EPKPSFWAEL
GKQMLKGLVD GATQIAIGQA NQQISIKTCV KGSKEYGFNA ADCFAKPGGG SSGISSGVGA
SSGAAGS