Gene Emin_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1067 
Symbol 
ID6263783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1159930 
End bp1161270 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content45% 
IMG OID642611547 
Productphosphotransferase system EIIC 
Protein accessionYP_001875956 
Protein GI187251474 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTATA AACAATTAGC CGGACAAATT TTACAAATAG CCGGCGGAAA AGAAAATATA 
ACGGGCGTTT CGGTTTGTAT GACCCGTTTA AGAATGGGTG TAAAAAACAG AAAAATAATT
GACGTTGAAA AAATTAAAGC CATTGAGGAA GTTTACGGAA TTGTGGATAA CGGGCTGCAG
CTGCAGGTTA TTCTGGGGCC CGGTAAAGCG GGCAAAGTGG CTGATGAGTT TGCCAAATTG
GTTTCAATGC CTTTGGGGAC GTCTTCAATA GCATATACGC TAAGGGAAGA GTTAAAAGAA
AAAAACCAAA CCCCCGCTAA AGTCATGCTT AAAAAAATAG CTAATATTTT TATTCCGTTA
ATACCCGCTT TTGTGGGCTG CGGTCTTATA ATGGCCGTTA ACAATATTTT AATTAAATAC
GCGCCGGGCT GGAGCGTAAC CAATTTATCC CAGATATTAA GTATATTCGG CAACGCGGTT
GTTGTGGGAT TAAGCGTTTT TGTGGGCATT AATACCGCGC GCGAATTTGG CGGATCGCCA
ATGATAGGCG GCGTTATGGC CGTTATTTTA ACAAACCCTA TGCTTGCGGA CATTAAACTG
TTTGGCGAAA ACCTTGTGCC GGGCCGCGGC GGTGTTATAG CTGTTTTAAT GGTGGTTATT
TTCGCCAGCT GGCTTGAAGT TAAAATACGG AAGCTTATGC CTAACTCTTT AGATTTATTT
TTGACGCCCG TGCTTGTTAT TTTAATAGCG GGTACCGCCG CGCTTATTGC GCTGCAGCCT
TTAGGCGGTT TCCTGTCTTT AGGTATAGTG CAGTTTGTTA ACTTTGCCAT AGCCAAAGGC
GGCTTTGTGG CGGGGTATTT GCTTTCATTC GCGTTTTTGC CGCTTGTTAT GCTTGGTCTT
CATCAAGGGT TAACCCCGAT ACACGCGCAG CTAATTGAGG TGTATGGATA TACTGTTTTA
TTTCCTATAC TTGCCATGGC CGGCGCGGGC CAAGTGGGCG CGGCCATAGC TGTTTTAATT
AAAACAAAAA ATAAAAAACT TAAAAAAACA ATTTGGTCGG GCCTCCCCGT TGCCGTGCTT
GGCGTTGGCG AACCTTTGAT TTACGGCGTT ACACTGCCAT TGGGCAGGCC ATTTTTGGCC
GCTTGTATAG GTGGAGGTTT CGGCGGCGCT ATTGTGGCGA GTTTTCAGGT AGGCGCTTTT
GTTATAGGCG GTATATCGGG CATTCCTTTA GTTCCCCTTA CAACAATGCC TGCGGTGTAC
CTGGCGGGGT TGTTTGTTTC TTATATTTGC GGGTTTATCG CCGCCTGGTT TATCGGCTTT
GAGGACCCGG TGGTTTTTTA A
 
Protein sequence
MDYKQLAGQI LQIAGGKENI TGVSVCMTRL RMGVKNRKII DVEKIKAIEE VYGIVDNGLQ 
LQVILGPGKA GKVADEFAKL VSMPLGTSSI AYTLREELKE KNQTPAKVML KKIANIFIPL
IPAFVGCGLI MAVNNILIKY APGWSVTNLS QILSIFGNAV VVGLSVFVGI NTAREFGGSP
MIGGVMAVIL TNPMLADIKL FGENLVPGRG GVIAVLMVVI FASWLEVKIR KLMPNSLDLF
LTPVLVILIA GTAALIALQP LGGFLSLGIV QFVNFAIAKG GFVAGYLLSF AFLPLVMLGL
HQGLTPIHAQ LIEVYGYTVL FPILAMAGAG QVGAAIAVLI KTKNKKLKKT IWSGLPVAVL
GVGEPLIYGV TLPLGRPFLA ACIGGGFGGA IVASFQVGAF VIGGISGIPL VPLTTMPAVY
LAGLFVSYIC GFIAAWFIGF EDPVVF