Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_2188 |
Symbol | |
ID | 5112880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 2377513 |
End bp | 2378682 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640492375 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001176914 |
Protein GI | 146311840 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000146225 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000569887 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGGAGT TGTTGTTTCT GCTTTTGCCT GTCGCCGCAG CCTATGGCTG GTATATGGGC CGCAGAAGTG CGCAACAAAC AAAGCAGGAT GAGGCTAACC GGCTGTCCCG TGATTACGTC GCGGGGGTAA ACTTCCTTCT GAGTAACCAA CAGGATAAAG CGGTAGACCT GTTCCTCGAC ATGCTTAAAG AGGACACCGG AACCGTTGAG GCCCATCTCA CTCTCGGAAA CCTGTTCCGC TCGCGTGGCG AAGTTGACCG TGCCATCCGC ATTCACCAGA CGCTGATGGA AAGCGCTTCT CTGACCTACG ATCAGCGTTT GTTAGCTGTT CAGCAGTTAG GCCGTGACTA TATGGCTGCG GGTCTTTATG ACCGTGCTGA AGATATGTTT AGCCAACTGG TCGATGAAAC AGAATTTCGC GTTAGCGCTC TGCAACAACT CCTGCAAATC TATCAGTCAA CCAGCGACTG GCAAAAAGCC ATCGACACCG CCGAGCGCCT GGTAAAACTG GGCAAAGATA AGCAGCGTGT CGAGATTGCG CATTTCTACT GTGAACTTGC CTTGCAGCAG ATGGCCAGCG ATGACATGGA AAAAGCCATG ACATTGCTGA AAAAGGGGGC CTCTGCCGAT CGTAACAGTG CGCGAATCTC CATCATGATG GGGCGCGTGT TTATGGCGAA GGGCGAGTTT GCCAAAGCGG TTGAATGCCT GTTACGCGTC ATCGATCAGG ACAAAGAACT GGTCAGTGAA ACGCTTGAAA TGTTGCAAAC GTGCTATCAA CAACTCGACA AACCGAATGA GTGGGTTGCC TTCTTACGTC GCTGCGTGGA AGAAAACACT GGCGCAATGG CAGAATTAAT GCTGGCGGAC GTGGTTGAGC AACACGAAGG CAACGACACC GCACAGGTCT ACATTACCCG CCAGTTACAG CGCCACCCGA CCATGCGTGT CTTCCACAAA TTGATGGATT ACCACCTTAA CGATGCGGAA GAAGGGCGCG CGAAAGAGAG CCTGATGGTG CTGCGCGACA TGGTGGGTGA GCAGGTGCGC AGTAAGCCGC GCTACCGCTG CCAGAAGTGT GGATTTACCG CGTATACGCT TTACTGGCAT TGTCCTTCAT GTCGCGCGTG GTCAACCATT AAGCCAATCC GCGGTCTGGA TGGGCAGTGA
|
Protein sequence | MLELLFLLLP VAAAYGWYMG RRSAQQTKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYDQRLLAV QQLGRDYMAA GLYDRAEDMF SQLVDETEFR VSALQQLLQI YQSTSDWQKA IDTAERLVKL GKDKQRVEIA HFYCELALQQ MASDDMEKAM TLLKKGASAD RNSARISIMM GRVFMAKGEF AKAVECLLRV IDQDKELVSE TLEMLQTCYQ QLDKPNEWVA FLRRCVEENT GAMAELMLAD VVEQHEGNDT AQVYITRQLQ RHPTMRVFHK LMDYHLNDAE EGRAKESLMV LRDMVGEQVR SKPRYRCQKC GFTAYTLYWH CPSCRAWSTI KPIRGLDGQ
|
| |