Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2346 |
Symbol | |
ID | 6065641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2584900 |
End bp | 2586069 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641601749 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001725308 |
Protein GI | 170020354 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000129079 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000238916 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGCTGGAGT TGTTGTTTCT GCTTTTGCCT GTAGCCGCTG CCTATGGCTG GTATATGGGC CGCAGAAGTG CGCAACAAAA CAAGCAAGAT GAAGCCAACC GCTTGTCGCG TGATTACGTA GCGGGGGTTA ACTTCCTGCT TAGTAATCAA CAGGATAAAG CGGTAGACCT GTTTCTCGAT ATGCTTAAAG AGGATACAGG CACCGTTGAA GCCCACCTTA CGCTCGGAAA CCTGTTCCGT TCGCGTGGCG AAGTTGATCG CGCTATTCGC ATCCATCAGA CCCTAATGGA AAGCGCCTCG CTGACCTATG AACAGCGTCT GTTGGCGATT CAACAACTGG GGCGTGATTA CATGGCCGCC GGGTTATATG ACCGCGCGGA AGACATGTTC AATCAGCTGA CCGATGAAAC TGACTTCCGC ATTGGCGCGC TGCAACAGTT GCTACAAATC TACCAGGCTA CCAGTGAGTG GCAGAAAGCA ATTGATGTTG CCGAACGCCT GGTGAAGCTG GGTAAAGATA AACAGCGCGT CGAAATTGCC CATTTCTACT GTGAGTTAGC CCTGCAGCAT ATGGCCAGCG ACGATCTCGA TCGTGCCATG ACCTTGCTAA AAAAAGGGGC GGCGGCAGAT AAAAACAGCG CCCGCGTATC CATAATGATG GGACGCGTGT TTATGGCGAA AGGAGAATAC GCCAAAGCCG TCGAAAGTCT GCAACGCGTG ATATCCCAGG ACAGAGAACT GGTCAGCGAA ACGCTGGAAA TGTTGCAAAC CTGCTATCAG CAGTTGGGTA AAACTGCCGA ATGGGCAGAA TTCCTGCAGC GTGCGGTGGA AGAGAACACC GGTGCCGATG CTGAATTGAT GCTTGCTGAT ATCATCGAAG CGCGCGACGG TAGTGAGGCC GCACAGGTCT ATATTACGCG CCAGCTTCAG CGTCATCCGA CCATGCGTGT GTTCCATAAG TTAATGGATT ACCACTTAAA TGAAGCGGAA GAAGGGCGTG CCAAAGAGAG CCTGATGGTG CTGCGTGACA TGGTTGGCGA AAAGGTACGT AGTAAGCCTC GTTATCGCTG CCAGAAATGT GGTTTTACCG CATACACCCT CTACTGGCAT TGTCCGTCTT GTCGGGCCTG GTCAACCATT AAACCGATTC GCGGTCTTGA TGGCCTGTAA
|
Protein sequence | MLELLFLLLP VAAAYGWYMG RRSAQQNKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYEQRLLAI QQLGRDYMAA GLYDRAEDMF NQLTDETDFR IGALQQLLQI YQATSEWQKA IDVAERLVKL GKDKQRVEIA HFYCELALQH MASDDLDRAM TLLKKGAAAD KNSARVSIMM GRVFMAKGEY AKAVESLQRV ISQDRELVSE TLEMLQTCYQ QLGKTAEWAE FLQRAVEENT GADAELMLAD IIEARDGSEA AQVYITRQLQ RHPTMRVFHK LMDYHLNEAE EGRAKESLMV LRDMVGEKVR SKPRYRCQKC GFTAYTLYWH CPSCRAWSTI KPIRGLDGL
|
| |