Gene Emin_1505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1505 
Symbol 
ID6263977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1597831 
End bp1599165 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content40% 
IMG OID642611993 
Productamino acid/peptide transporter 
Protein accessionYP_001876390 
Protein GI187251908 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000267924 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACAAG ACGCAGTAGC GCAAAAACAA AAACAACCTA AAGCTTTATA TATGCTTTTT 
ATGGTTGAAA TGTGGGAAAG GTTCAATTAC TATGGTATGA GAGCTTTACT TGCTTTATTT
ATGGTTAGCA CTGTAATAGG ATTTACAAAA GCCACCTCAA GTAAAATATA CGGTATGTTT
ACCGCTTTAG TTTACTTAAC CCCTGTAATC GGCGGTTATT TGGCCGATAA GTTTATAGGT
AAAAGACACT CTATAACAAT AGGCGCCATT TTAATGGCCA TGGGCCAGTT CACATTAGCT
TCTTATGAGC TTATTCCTGC AAGATTAGCC TTGTGCATCG GTTTAGTCTT AATTATTATC
GGCAACGGTT TCTTTAAACC TAATATTTCC GCTATAGTCG GCGAATTGTA TGAAGAAAAC
GACCCCAGAA GAGACGGAGG TTTTACCATT TTCTACATGG GTATTAACCT TGGCGCTTTT
ATAGCTCCAT TTGTCTGCGG CACATTAGGC CAAAAAATCG CTTGGAAATA CGGTTTTATG
TCAGCCGGTA TCGGTATGCT TATAGGTCTT GTGTGGTATT TGGTTTCACA GAAAAAGTTT
TTAGGCGATA TAGGTTTATA CCCTGTAAGC AAAGTAACTA CCTCTAACAA AGAAGAACTT
AACAGACCTT TAACAAAAGA AGACAAAGAC AAAATTAAAG CGATTTCCGT ATTCGTGTTT
TTTGCCGTGT TTTTCTTTGC TTTTTTTGAA CAGGCGGGAA CTTCACTTAC CTTCTTCGCG
GAAGAAGCTA CAAGGCTTTA CGTAAACCTT CCTTTCTTTG GGCAAGTTAA ACTGGAATCT
TCTTATTTCC AGGCTATTAA CCCTATATTT GTTATCTTAT TAGCTCCTAT ATTTGCCAAA
CTTTGGCTTA ACTTAGGCGC TAAGAAAAAG GAACCCTCAA TCCCCAATAA GTTCGGCTGG
GGTCTTTTTC TCCAAGGTAT CGGTTTTGCG GTTATAGCGG TAGGCGCCAG CTTCTTCTTA
AAGGGCGGCC CTGTAAGTGC TATTTGGCTT ATCGGCGTGT ACTTTTTCTG CACCACCGGC
GAACTTTGCT TAAGCCCTGT AGGTTTATCA ATGGTTACAA AACTTGCTCC CGCTAAACTT
ATGTCACTCT TAATGGGTGT TTGGCTTATG TCAAGTTTCT TTGGAAACCT TCTTGCGGGC
TGGTTGGCAA GCTTCTATGA ATCATGGCAG CTTACAACAT TGTTCTCCGT ACCTGCGGTG
CTCTCCATAA TGTTTGGCGT AATTATGTGG TTAATGACAA ACAAAGTAAA ACGATGGATG
CATGGCGTAA ACTAA
 
Protein sequence
MSQDAVAQKQ KQPKALYMLF MVEMWERFNY YGMRALLALF MVSTVIGFTK ATSSKIYGMF 
TALVYLTPVI GGYLADKFIG KRHSITIGAI LMAMGQFTLA SYELIPARLA LCIGLVLIII
GNGFFKPNIS AIVGELYEEN DPRRDGGFTI FYMGINLGAF IAPFVCGTLG QKIAWKYGFM
SAGIGMLIGL VWYLVSQKKF LGDIGLYPVS KVTTSNKEEL NRPLTKEDKD KIKAISVFVF
FAVFFFAFFE QAGTSLTFFA EEATRLYVNL PFFGQVKLES SYFQAINPIF VILLAPIFAK
LWLNLGAKKK EPSIPNKFGW GLFLQGIGFA VIAVGASFFL KGGPVSAIWL IGVYFFCTTG
ELCLSPVGLS MVTKLAPAKL MSLLMGVWLM SSFFGNLLAG WLASFYESWQ LTTLFSVPAV
LSIMFGVIMW LMTNKVKRWM HGVN