Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0912 |
Symbol | |
ID | 5111101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 1014018 |
End bp | 1015718 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640491088 |
Product | extracellular solute-binding protein |
Protein accession | YP_001175647 |
Protein GI | 146310573 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCAGC TAAATCGTCT CAACCAATAT CAACGACTGT GGCAACCGTC TGCGGGTGCC ACTCAACACA TTACGGTAAA CGAACTTGCC AGCCGCTGCT TTTGCAGCGA ACGGCATGTA CGAACGCTGT TGCGACAGGC ACAGGATGCC GGTTGGCTAA GCTGGCGAGC GCAATCCGGG CGCGGCAAAC GAGGGGAGCT GACCTTCCAC GTTTCGCCAG ACTCTCTGCG CAATATCATG ATGGAAGAGG CGCTAAAAAG CGGGCATCAG CATAACGCGC TGGAACTGGC ACAAATTGCG CCACAGGAAC TCCGCACGCT GCTGCATCCG TTCTTGGGTG GTCAGTGGCA AAACGATACG CCCACGCTGC GTATTCCTTA CTATCGCTCG CTTGATCCGC TGCACGCAGG ATTTTTACCC GGGCGGGCTG AGCAGCATCT TGTGGGGCAA GTCTTCTCGG GGCTCACACG GTTCAACGGC AACAGTAGCG AACCCACGGG GGATCTGGCT CATCACTGGG AGGTGTCGGC CGACGGGTTA CGCTGGCATT TTTACATTCG CTCGACGCTG CACTGGCATA ACGGCGATAA GGTCGAGACG GCACAGCTTC AGTCCAGCTT GATAAAATTG CTCGATCTCC CTGCGCTGCG CCGATTGTTC AACAGCGTAT TGCGCATCGA TGTGACACAT CCGCAATGCC TGACGTTTGT TTTACATAAG CCGGATTACT GGCTCGCCCA TCGCCTGGCA AGTTATTGCA GCCGCCTGAC CCATCCTGAT CGCCCGTTGA CCGGCAGCGG TCCGTTTCGT TTGACGGCAT ACGAGCCTGA TTTGGTCCGG CTGGAGAGCC ACGAGCAGTA CCACCTCAGT CACCCTTTGC TGAAGGCCAT TGAATACTGG ATCACCCCGC AACTTTTTGA ACAGAATTTA GGCACCAGCT GTCGCCACCC CGTGCAAATC GCCATTGGTG AGCCGGATGA ACTGGCCAAT CTGAGTCTGG TCAGCAGCAG TACCAGCCTG GGATTCTGTT ATCTCACGCT GAAACAAAGC CCGCGTCTTA GCCAAATGCA GGCGCAGCGT CTCATCACTA TCATCCATCA CACCACGCTG CTGCATACGC TCCCGCTGGA TGAAAACCTG ATTACGCCTG CACAGGAGTT GCTGCCGGGC TGGACTATTC CCGGCTGGCC AACGCTGAAT ACGGTCGCGC TGCCTGAGAC GCTGACCCTG GTTTATCATT TGCCCGTTGA ACTTCATTCC ATGGCAGAGC AGCTCAAACG TTATCTGGCC AATCTGGGCT GCAATCTGAC GGTCGTTTTT CATGACGCCA AAACCTGGGA CGGATGCAGC GCGCTTGCCG ATGCCGATCT GATGATGGGC GACAGGCTGA TAGGTGAAGC GCCGGGCTAT ACGCTGGAGC AATGGCTCCG CTGCGATACG CTGTGGCCGC ACCTGTTAAG CGCGCCGCAA TATGCTCATC TCCAGGCGAC ACTGGATGCG GTGCAGATAC AGACTGATGA GCAAGCGCGT CACGCGGGGC TGAAAGCCAT CTTCACAAAT CTAATGGAAA ATGCCGTACT GACGCCGCTG TTTAACTATC AATATCAAAT CAGCGCTCCG CCGGGGGTTA ACGGTATCCA CCTTAACACC CGTGGCTGGT TTGATTTCAC GCAGGCGTGG CTTCCGGCGC CAAATACGTG A
|
Protein sequence | MRQLNRLNQY QRLWQPSAGA TQHITVNELA SRCFCSERHV RTLLRQAQDA GWLSWRAQSG RGKRGELTFH VSPDSLRNIM MEEALKSGHQ HNALELAQIA PQELRTLLHP FLGGQWQNDT PTLRIPYYRS LDPLHAGFLP GRAEQHLVGQ VFSGLTRFNG NSSEPTGDLA HHWEVSADGL RWHFYIRSTL HWHNGDKVET AQLQSSLIKL LDLPALRRLF NSVLRIDVTH PQCLTFVLHK PDYWLAHRLA SYCSRLTHPD RPLTGSGPFR LTAYEPDLVR LESHEQYHLS HPLLKAIEYW ITPQLFEQNL GTSCRHPVQI AIGEPDELAN LSLVSSSTSL GFCYLTLKQS PRLSQMQAQR LITIIHHTTL LHTLPLDENL ITPAQELLPG WTIPGWPTLN TVALPETLTL VYHLPVELHS MAEQLKRYLA NLGCNLTVVF HDAKTWDGCS ALADADLMMG DRLIGEAPGY TLEQWLRCDT LWPHLLSAPQ YAHLQATLDA VQIQTDEQAR HAGLKAIFTN LMENAVLTPL FNYQYQISAP PGVNGIHLNT RGWFDFTQAW LPAPNT
|
| |