Gene Emin_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1104 
Symbol 
ID6263433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1205369 
End bp1207399 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content41% 
IMG OID642611584 
Productamino acid transporter 
Protein accessionYP_001875993 
Protein GI187251511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAG ATAATAAAAC TGCCGCCAAA AGCTCTTTAG GCGTTAAAAA GGTAATGGTG 
CTTACAACCG CCGTTCTTAC CTTTATACCT TTTTGGAAAG CGGCGGCCGT TGTTTTGTGC
GACTTTGGAT CCAGCGCTTT TTATGCGGGC GGGATAGCCA TGCGCGCGTT CGGCCCGGCT
TTTCCTTGGT ATATTTTATT TGTAATGCTG TTCTCGGGCA TACTTTTAAC GGTGTACATA
GAATCCTGCT CTCTTTTTGT ACGCGGCGGC ATATATAAGG TAGTAAAAGA AGGCCTTGGA
AACCGCGCGG CTAAGATAGC GGCTTCGGCC ATAGTGTTTG ACTTTATTTT AACAGGCCCT
ATAAGCGCGG TCACGGCGGG TCACTATCTT TCGGGCTTTA TAAATTCGAC AATGTCTTAC
TTCAGCGTCA ATATAAGAGT GCCGGATAAT ACATTTTCTA TAATATTCGC TTTGGCGGTT
ACGGTTTATT TTTGGCGCCA AAATATAAGG GGTATTTCCG AAAGTACGGA TAAAAGCGCT
AAAATAATTT TATTTTCCTT AATAGTCGCT TTTGTTTTAG CTGTTTGGGC TATTATTACC
ATATTCATAA AAGGCGCCTC CCTCCCCCCT TTAAAACCGC AGTTTACCGA ATACGCTTTG
GGCTGGTCGA CCAAATTTGA CTGGATGAAA GCCGTTGGTT TAATGGGCGT TATGATGGCC
ATAGGCCACA GCGTTTTGGC GTTAAGCGGC TTGGAAACGC TGTCCCAGGT TTACAGAGAA
ATTGAATTTC CTAAAATAGA AAACCTTAAA AAAGCGGCAA TAACCGTATT TATATTTGCT
CTTTTGTTTA CGGGCGGGTT AACTTTTTTA TCTTCCGTAA TAATACCCTC TGACTTAATC
GCCGCAAAAT ATCATGAAAA CCTGCTCTCC GGGCTTGCCA TGGAACTCAG CGGGCCTAAA
ATTTTAAGAC TTTTTATGCA AGCGGCGCTT GTTATAAGCG GCACAATAAT GTTATCGGGC
GCGGTTAATA CCGCGCTTAT AGGCTCAAAC GCCGTTCTTA ACAGAATAGC CGAGGACGGA
ATTCTAACCG ACTGGTTTAG AAAAATACAT AAAAAACACG GCACCACTTA CCATATGATA
AACCTTATAG TGGTAATACA AATGGCCGTA ATAATATTTT CAGGCGGTAA GGTGTATCTT
TTGGGCGAGG CGTACGCGTT TGGCGTTTTA TGGAGCGCGG TGTTGGAAAC CCTTTCTTTA
ATAATGCTGC GCTTTAAACA ACCGCAAACA AGAACCTTTA TGGTGCCTTT AAATTTTAAA
TTTAGAAATT ACACTATTCC TTTAGGCGCA ACGCTTATCT TTTTATTTTT ATTTTCACTT
GCTTCAATTA ACCTGCTTAC AAAAAGAACT GCCACAATAT CGGGCTTAAC TTTTACGGCA
ATACTGTACA CTGTGTTTTA TATATCGGAA AGGCTTAACG CAAAAAAAGC AAACATAATG
TTTGAGGAAG GACACAGAGA GGAAATTAAC ACTTCAACCG TTTCAACCTT AAATGAAGCT
TTGGAAGACC TGGAACATCA GGATCGTGTT GTGATAGGCG TTAAAAACCC CGATAACCTT
TATCATTTAG AAGAATTTTT AAAAACAGTG GAAGGCGATT CAACAGATAT CATTGTACTT
TACGCAAAGC CTTCTAAAGA CACTATTTTT GGAAAAGGCT CCCTTAAAGC CGCTCCGATG
GACGATAAAG AAATATTTTC AAACGTAATA CTTATAGCCG AAAAATACGG GCACGGAATT
ATACCTTTAA TGGTGGAATC TAATGACCCG TACTACGCCA TAAGCCAGGT GGCGCATACG
GCCGATGCGG ACAATATTAT TTTAGGAGTG TCAGGCTCGC ACGGCGCTAA TGACCAAATG
GAACGCATGG TTATGGCCTG GGGCGCGGTG CATGATAAAA AATTAGACCA CCCGGTAATT
GTAAAAATAT TGTGGGAAGG GCGCGAAGTA ACTTTTAAAT TTAACAGATA A
 
Protein sequence
MSTDNKTAAK SSLGVKKVMV LTTAVLTFIP FWKAAAVVLC DFGSSAFYAG GIAMRAFGPA 
FPWYILFVML FSGILLTVYI ESCSLFVRGG IYKVVKEGLG NRAAKIAASA IVFDFILTGP
ISAVTAGHYL SGFINSTMSY FSVNIRVPDN TFSIIFALAV TVYFWRQNIR GISESTDKSA
KIILFSLIVA FVLAVWAIIT IFIKGASLPP LKPQFTEYAL GWSTKFDWMK AVGLMGVMMA
IGHSVLALSG LETLSQVYRE IEFPKIENLK KAAITVFIFA LLFTGGLTFL SSVIIPSDLI
AAKYHENLLS GLAMELSGPK ILRLFMQAAL VISGTIMLSG AVNTALIGSN AVLNRIAEDG
ILTDWFRKIH KKHGTTYHMI NLIVVIQMAV IIFSGGKVYL LGEAYAFGVL WSAVLETLSL
IMLRFKQPQT RTFMVPLNFK FRNYTIPLGA TLIFLFLFSL ASINLLTKRT ATISGLTFTA
ILYTVFYISE RLNAKKANIM FEEGHREEIN TSTVSTLNEA LEDLEHQDRV VIGVKNPDNL
YHLEEFLKTV EGDSTDIIVL YAKPSKDTIF GKGSLKAAPM DDKEIFSNVI LIAEKYGHGI
IPLMVESNDP YYAISQVAHT ADADNIILGV SGSHGANDQM ERMVMAWGAV HDKKLDHPVI
VKILWEGREV TFKFNR