Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1105 |
Symbol | |
ID | 5114054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 1212971 |
End bp | 1214539 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640491280 |
Product | extracellular solute-binding protein |
Protein accession | YP_001175837 |
Protein GI | 146310763 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAA AATTGCTGCC GTTACTGGTG CTGGCTGCGC TGTCTGCTGC CGCCCACGCC GCTACTCCGC CAAATACACT GGTTGTCGCA CAGGGTCTTG ATGACATCGT CAGCCTCGAT CCCGCCGAAG CCAATGAGCT TTCGAGTATT CAGACGGTGC CCAGCCTCTA CCAGCGTCTG GTTCAGCCAG ACCGCGATAA TCCGGAAAAA ATCACCCCAA TTCTGGCGGA AAGCTGGCAG GAAGATGCCG CTGCCAAAAC GCTCACCATC AAGCTGAAAC CCGACGCGAA ATTTGCCTCC GGCAACCCAT TGCGCCCGGA AGACGTGATT TTCTCGTATA CCCGCGCGGT CACGCTGAAC AAATCCCCGG CATTTATTCT GAACGTGCTG GGCTGGGAGC CAAACAATAT CGCCAGCCAG CTGAAAAAAG TGGATGACCA TACGCTCACG CTGCACTGGA CGGCGGACGT CAGCCCGGCG GTGGCGCTGA ATATTCTCTC CACGCCGATC GCCTCGATCG TGGATGAGAA GCAAGTTTCC GCGAATGTGA AAGACAACGA CTTCGGCAAC GCATGGCTGA AAATGCATTC TGCGGGTAGC GGCGCGTTCA AGATGAAGGT TTACCAGCCG CATCAGGCCA TCGTGCTGGA GGCCAACGAA ACGGCGCCTG GTGGCGCACC GAAGCTGAAA AGTATCATTA TTAAAAACGT GCCGGATCCG GCCTCTCGCC GTCTGCTGAT CCAGCAGGGC GATGCAGACG TGGCGCGCGA TCTGGGTGCT GACCAAATCA GCGCGTTGGA AGGCAAAGCG GGTGTGAAGG TGCTGAGCAT TCCGTCAGCC GAGCAAAACT ATCTGGTGTT TAACGCCGGA AATAGCGCCA ATCCGCTGCT GAATAATCCG GCATTCTGGG AGGCTTCGCG CTGGCTGGTA GATTATGAAG GCATCACCAA AAATCTGCTG AAGGGCCAAT ATTTTGTCCA TCAGAGCTTC CTGCCGGTCG GTCTGCCGGG CGCGCTGGAA GATAACCCGT TTAAATTCGA TCCGGCCAAA GCGAAAGAAA TCCTCGCCAA AGCGGGCATC AAAGACGCGC ACTTCACGCT GGATGTGGAG AACAAACCGC CGTTTATCAC TATCGCCCAG TCGATGCAGG CGAGCTTTGC TCAGGGCGGC GTAAAAGTGG ATCTGCTCCC GGCCGCGGGT AGCCAGGTGT ATGCCCGTGT GCGCGCGAAG CAGCATCAGG CCGCGATTCG CCTGTGGATC CCGGATTATT TCGATGCACA CTCTAACGCG AGCGCCTTTG CGTGGAACGA CGGGAAATCC AGCACCGTGG CAGGGCTGAA CGGCTGGAAA ATCCCCGAGC TGACCAAAGC CACGCTGGCA GCGGTCGCCG AACCGGATCC GGCGAAACGT CTGGATCTAT ACACTAAAAT GCAGCAGGAA TTGCAGCATA ACTCCCCGTA CGTGTTTGTC GATCAGGGCA AAACCCAAAT TGTGGTGCGC GACAACGTGA AGGGTTATCA GCAGGGGCTG AATGCGGATA TGGTTTGGTA CGATCGGGTG ACGAAATAA
|
Protein sequence | MTKKLLPLLV LAALSAAAHA ATPPNTLVVA QGLDDIVSLD PAEANELSSI QTVPSLYQRL VQPDRDNPEK ITPILAESWQ EDAAAKTLTI KLKPDAKFAS GNPLRPEDVI FSYTRAVTLN KSPAFILNVL GWEPNNIASQ LKKVDDHTLT LHWTADVSPA VALNILSTPI ASIVDEKQVS ANVKDNDFGN AWLKMHSAGS GAFKMKVYQP HQAIVLEANE TAPGGAPKLK SIIIKNVPDP ASRRLLIQQG DADVARDLGA DQISALEGKA GVKVLSIPSA EQNYLVFNAG NSANPLLNNP AFWEASRWLV DYEGITKNLL KGQYFVHQSF LPVGLPGALE DNPFKFDPAK AKEILAKAGI KDAHFTLDVE NKPPFITIAQ SMQASFAQGG VKVDLLPAAG SQVYARVRAK QHQAAIRLWI PDYFDAHSNA SAFAWNDGKS STVAGLNGWK IPELTKATLA AVAEPDPAKR LDLYTKMQQE LQHNSPYVFV DQGKTQIVVR DNVKGYQQGL NADMVWYDRV TK
|
| |