Gene Emin_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0834 
Symbol 
ID6262459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp917863 
End bp919161 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content43% 
IMG OID642611312 
Productamino acid/peptide transporter 
Protein accessionYP_001875726 
Protein GI187251244 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.655453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.033682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACTA ATAAAAATAA ACAGCCTCCT GCCCTGTTCA TGCTTTCAGG CGTGGAAATG 
TGGGAACGTT TTAATTATTA CGGGATGCGC GCGCTGCTGG TGCTTTTTAT GACCAGCCAA
ATAATAGGCC TTTCAGACAG GGCCGCGGGC CGAGTTTATG GACTTTTCGG CGCTTTGGTA
TACCTTACGC CGGTATTTGG CGGTTTAATA GCGGACGCCT ATTTAGGAAA AAGAAAATCT
ATTATCATAG GCGCCGTTTT AATGATGTGC GGGCAGTTTG TGCTTGCCTC TTATGGGTTT
TTACCTCCTA TAGCGGCTTT AGCAATAGGT TTAACCCTTA TTATAGCGGG CAATGGTTTT
TTTAAACCGA ATATTTCCTC TATAGTGGGG GAACTCTATG ATGAAAATGA CAATCGCCGC
GACGCCGGTT TTACCATATT TTATATGGGT ATTAACATAG GCGCGTTTTT AGCTCCGTTA
GTATGCGGTT ATTTAGGCGA AAAAGTAGCC TTCAGATACG GCTTTTTAGC CGCCGGTATA
GGCATGTTAA TCAGTTTAGT ATGGTTTATT TGGTTAAAGA ACAGGTTTTT GGGCGATATA
GGCATACGTC CCGCTATTGA GGAAAATAAA AACGATAAAG GGGAAAATGA GCCCCTTACC
AAAGTGGAAA AGGACAGGAT TTTAGCTATA TTCATATTTA CTTTCTTTTC AATCTTTTTT
TGGGCGTTTT ATGAGCAGGC GGGTTCTTCT TTAACTTTGT TCGCGGACAG ATCCACTGAC
AGAGTAATAT TCGGGTGGGA AATGCCTACA AGCTTTTTTC AGTCTTTCCC GGCATTACTT
GTGGTGTTAT TAGCGCCCGT GTTCGCCTGG TTATGGCGCA GAATGGGCGA AAAAGAACTT
TCCACCCCCG CTAAATTTGC CTGGGGTTTA GCTTTACTGG GCATAGGTTA TATTATCATA
GCCATAGCCG CATACGCATA CAAAAACAGC GGCCTGGTAA GCATATTTTG GCTTTGCGGT
TTGTATCTTA TGCATGTTTT GGGCGAATTA TGTATTTCTC CCGTAGGATT ATCTATGATT
ACCAAGCTTT CACCCGCTAA ATATGTTTCT TTATTTATGG GTGTTTGGTT CGCTTCGGAC
TTTTTCGGCG GACTTTTGGG CGGCTTCTTC GCGGGGGAAT ATAATGAGGC CAGTTTGGTT
TCTTTATTCT CTATACCCGC GGCTACGGCC CTTATATGCG CTTTAATTAT TTGGGCGCTT
TCGGGCAAGC TTAAAAAGTG GATGCACGGT ATAAACTAA
 
Protein sequence
MDTNKNKQPP ALFMLSGVEM WERFNYYGMR ALLVLFMTSQ IIGLSDRAAG RVYGLFGALV 
YLTPVFGGLI ADAYLGKRKS IIIGAVLMMC GQFVLASYGF LPPIAALAIG LTLIIAGNGF
FKPNISSIVG ELYDENDNRR DAGFTIFYMG INIGAFLAPL VCGYLGEKVA FRYGFLAAGI
GMLISLVWFI WLKNRFLGDI GIRPAIEENK NDKGENEPLT KVEKDRILAI FIFTFFSIFF
WAFYEQAGSS LTLFADRSTD RVIFGWEMPT SFFQSFPALL VVLLAPVFAW LWRRMGEKEL
STPAKFAWGL ALLGIGYIII AIAAYAYKNS GLVSIFWLCG LYLMHVLGEL CISPVGLSMI
TKLSPAKYVS LFMGVWFASD FFGGLLGGFF AGEYNEASLV SLFSIPAATA LICALIIWAL
SGKLKKWMHG IN