Gene Ent638_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2107 
Symbol 
ID5112228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2283695 
End bp2285293 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content51% 
IMG OID640492294 
Productextracellular solute-binding protein 
Protein accessionYP_001176833 
Protein GI146311759 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAG GGAAAACGCT GCTCGCATTA GCGCTGAGCA CACTGTTACC GGCAAGTGCT 
GCGTGGGCGG CAAAAAGCGA TACCATTATT TATTGCTCTG AAGCCTCACC CGAGTCGTTT
AACCCGCAAA TTGCCAGTTC AGGCCCGTCA TTCGTCGCCA GCTCACAGGT GCTTTATAAC
CGTCTGATCA ACTTTGATCC GGTGAAAAAT ACACCGGTTC CTTCCCTGGC AGAGTCCTGG
ACTATTTCAC CCGACGGCAA AACCTATACT TTTGCCCTGC GTAAAGGGGT GAAATTCAAT
AGCAACAAAT TCTTCAAACC GACGCGCGAT TTTAACGCCG ATGACGTTAT TTTCTCGGTG
ATGCGTCAGA AAGACCCTAA ACATCCGTAC CATAATGTGT CGAAGGGCAA CTACGAATAC
TTTAATGATG TCGGCCTGGA TAAACTGATT CAGGACGTTA AAAAGATCGA TGATTATCAC
GTTCAATTTG TATTAACCGA ACCGAATGCG GCGTTCCTGG CGGACTGGGG AATGGACTTT
GCTTCGATAC TGTCTGCGGA ATATGCGGAC GTGATGCTGA AAAAAGGCAC GCCGGACAAC
GTGGATACCT GGCCTATCGG TACAGGCCCT TATGTGCTGC AACAGTACAA AGTCGATTCA
CTGATCCGCT ATATCGCGAA CCCAAATTAC TGGGATGGCG AAGTGCCGAC CAAGCACCTG
ATTTTCTCCA TCACGCCAAA CGTCGAAACC CGCCTGGCGA AGCTGCAAAC CAACGAATGC
CAGATCATTC CTGCGCCATC ACCGGTGCAA TTTGATGTGA TTAAGAAGAA TAACGATCTG
GCGCTGCACT CTGTTGATGC GCTGAACGTC GGCTATCTGG CGTTTAATAC CGAGAAAAAA
CCGTTTGATA ACGTGCTGGT GCGTCAGGCG CTTAACTATG CCACGGACAA AAAAGCGATC
GTGAACGCCG TCTTTATGGG CTCCGGTACG GTGGCGAAGT CACCGATTCC GCCAAATATG
ATGGGCTATG ACAAAGAGTT GAAGGACTAC AGCTACGATC CTGAGAAGGC GAAAACGCTA
CTGAAGCAGG CTGGGCTGGA GCAGGGCGCG GAAGTGACGC TGTGGTCAAT GCCGGTTCAA
CGTCCTTACA ACCCAAATTC ACGCCGTATC GCGGAGATGA TCCAAAACGA CTGGGCGAAA
GTGGGCGTGA AGGCGAAGAT TGTCTCCTAT GAGTGGGGCG AATACTTGTC TGGTATGCGT
AAAGGCGAGC ATGATTCTGC GCTGTTTGGC TGGATGTCCG ATAACGGCGA TCCAGATAAC
TTTGCGGATG TGCTGCTGGG CTGTAACAGC ATCAAAACCG GTTCAAATGC CGCGCGCTGG
TGCGATAAGG GATACAATGA CCTGGTGCAG AAAGCGAAGT TGACCAGTGA CCCGGCCGCA
CGTGCCAAGC TCTACGGTCA GGCGCAGGAA ATTTTCTACC AGCAAGCGCC GTGGATTGCG
CTGGCGAACG GCAAAACGTT CTTCGCCACG CGCAGCAACG TGACGGGTTA TAGCGTCAGC
CTGATGGGCA GCGATTTCTC GAAAGCGAAG CTGAACTAA
 
Protein sequence
MSTGKTLLAL ALSTLLPASA AWAAKSDTII YCSEASPESF NPQIASSGPS FVASSQVLYN 
RLINFDPVKN TPVPSLAESW TISPDGKTYT FALRKGVKFN SNKFFKPTRD FNADDVIFSV
MRQKDPKHPY HNVSKGNYEY FNDVGLDKLI QDVKKIDDYH VQFVLTEPNA AFLADWGMDF
ASILSAEYAD VMLKKGTPDN VDTWPIGTGP YVLQQYKVDS LIRYIANPNY WDGEVPTKHL
IFSITPNVET RLAKLQTNEC QIIPAPSPVQ FDVIKKNNDL ALHSVDALNV GYLAFNTEKK
PFDNVLVRQA LNYATDKKAI VNAVFMGSGT VAKSPIPPNM MGYDKELKDY SYDPEKAKTL
LKQAGLEQGA EVTLWSMPVQ RPYNPNSRRI AEMIQNDWAK VGVKAKIVSY EWGEYLSGMR
KGEHDSALFG WMSDNGDPDN FADVLLGCNS IKTGSNAARW CDKGYNDLVQ KAKLTSDPAA
RAKLYGQAQE IFYQQAPWIA LANGKTFFAT RSNVTGYSVS LMGSDFSKAK LN