Gene Ent638_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_0184 
Symbol 
ID5110445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp213236 
End bp214843 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content52% 
IMG OID640490342 
Productextracellular solute-binding protein 
Protein accessionYP_001174925 
Protein GI146309851 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.507506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTT CCTTGAAGAA GTCAGGGATG CTGAAGCTTG GTCTGAGCCT GGTTGCTCTG 
ACCGTCGCGG CAAGCGTAAA TGCGAAAACC CTGGTTTACT GTTCTGAAGG CTCGCCGGAA
GGCTTTAACC CACAGCTCTT CACCTCTGGT ACCACGTACG ACGCAAGCTC TGTGCCTATC
TATAACCGTC TGGTTGAATT CAAAATCGGT ACTACGGAAG TGATTCCGGG TCTGGCGGAG
AAATGGGAAG TCAGCGAAGA CGGCAAAACC TATACCTTCC ACCTGCGTCA GGGCGTGAAG
TGGCAGGACA GCAAAGAGTT TAAACCGACT CGCGATCTGA ATGCCGACGA CGTTGTGTTC
TCCTTTGATC GTCAGAAAAA TGCCCAGAAC CCGTACCATA AAGTCTCTGG CGGCAGCTAC
GAATACTTCG AAGGTATGGG CCTGCCAGAC CTGATTACCG AAGTGAAAAA AGTGGACGAT
AAGACCGTTC AGTTTGTTCT GAGCCGTCCG GAAGCGCCAT TCCTGGCTGA CCTGGCAATG
GACTTCGCGT CAATTCTGTC GAAAGAATAT GCGGATAACA TGCTGAAAGC GGGCACGCCG
GAAAAAGTGG ATCTGAACCC AATCGGTACC GGTCCATTCC AACTGCTGCA ATACCAGAAA
GACTCACGCA TTCTGTACAA AGCGTTCGAA GGTTACTGGG GTACTAAGCC GCAGATCGAC
CGTCTGGTCT TCTCCATCAC GCCTGACGCT TCCGTGCGTT ACGCAAAACT GCAGAAAAAC
GAGTGCCAGG TTATGCCGTA CCCGAACCCG GCTGATATCG CTCGTATGAA GCAGGACAAA
AACATCAATC TGCTGGAGCA GGCTGGCCTG AACGTGGGTT ACCTGTCCTT CAACACCGAG
AAGAAACCGT TTGATGACGT GAAAGTACGT CAGGCGCTGA CTTACGCGGT AAACAAAGAA
GCGATCATCA AGGCGGTTTA TCAGGGCGCA GGCGTTGCCG CGAAGAACCT GATTCCACCA
ACGATGTGGG GTTATAACGA CGACGTTAAG GATTACACTT ACGACGTTGA GAAAGCGAAA
GCACTGCTGA AAGAAGCCGG TCAAGAGAAA GGCTTTACCG TTGAGCTGTG GGCGATGCCT
GTACAGCGTC CATACAACCC GAACGCTCGC CGCATGGCTG AGATGGTTCA GGCTGACTGG
GCTAAAATCG GCGTTCAGGC CAAGATCGTG ACCTACGAGT GGGGCGAGTA TCTGAAGCGT
GCTAAAGCCG GTGAACACCA GGCGGTGATG ATGGGTTGGA CCGGGGACAA TGGGGATCCG
GATAACTTCT TCGCGACCCT GTTCAGCTGT GATGCAGCGA AACAAGGCTC CAACTACTCT
CGCTGGTGCT ACAAGCCGTT TGAAGACTTG ATTCAGCCGG CACGTGCGAC CGAAGACCAC
AACAAGCGTA TCGAACTGTA CAAACAGGCT CAGGTAGTGA TGCATGACCA GGCTCCGGCG
CTGATCGTGG CTCACTCCAC CGTGTACGAG CCAGTACGTA AAGAAGTGAA AGGCTACGTG
GTTGATCCAC TGGGCAAACA CCACTTCGAA AACGTATCGG TTGAATAA
 
Protein sequence
MSISLKKSGM LKLGLSLVAL TVAASVNAKT LVYCSEGSPE GFNPQLFTSG TTYDASSVPI 
YNRLVEFKIG TTEVIPGLAE KWEVSEDGKT YTFHLRQGVK WQDSKEFKPT RDLNADDVVF
SFDRQKNAQN PYHKVSGGSY EYFEGMGLPD LITEVKKVDD KTVQFVLSRP EAPFLADLAM
DFASILSKEY ADNMLKAGTP EKVDLNPIGT GPFQLLQYQK DSRILYKAFE GYWGTKPQID
RLVFSITPDA SVRYAKLQKN ECQVMPYPNP ADIARMKQDK NINLLEQAGL NVGYLSFNTE
KKPFDDVKVR QALTYAVNKE AIIKAVYQGA GVAAKNLIPP TMWGYNDDVK DYTYDVEKAK
ALLKEAGQEK GFTVELWAMP VQRPYNPNAR RMAEMVQADW AKIGVQAKIV TYEWGEYLKR
AKAGEHQAVM MGWTGDNGDP DNFFATLFSC DAAKQGSNYS RWCYKPFEDL IQPARATEDH
NKRIELYKQA QVVMHDQAPA LIVAHSTVYE PVRKEVKGYV VDPLGKHHFE NVSVE