Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_F0002 |
Symbol | |
ID | 5585660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009786 |
Strand | + |
Start bp | 1352 |
End bp | 3262 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640913725 |
Product | hypothetical protein |
Protein accession | YP_001451375 |
Protein GI | 157149386 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000524095 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCATA AAACAGACAC AGCCCCTGTA CAGGAGCAGG CAGGTCTGAC GTTTCGTCTG GAGACCTTTG AATGGCAGGT GCACCAGGGG CTTAACGAAG AGGCGGCCCG GTCCCTGATA TCGCTCTTAC AGTTGCTGGA CCGACATTAT GCGCAGTGGG GGGAGAGCTT TTCCGCCTGG GCGCCGGGGA TGACGGCAGA GGAGATAAAT CCCCATCTGT GCACCCGTAT TGCCGGGGCC ATCACGGCGC TGTTCTCCCG TCCGGGGTTC CGGGTCAGCG ACGGCGGTTT TGCGGAGCTG ATGGACTATC ACCGCTGGCT GGCCATTATT TTTGCCGTCA GCGACTACCG CCACGGCGAC CATATCATCC GCAACATCAA CGCGGCCGGG GGCGGGGTGG TTGCCCCCCT GACCCTGAAC GCGGATAATC TGCAGCTGTT CTGCCTGAGT TATTACCCGG ATTCACAGAT AGCCCTGCAG CCGGAGCCGC TCTGGCAGTA TGACCGACAG ACGGTGGTCC GGCTGTTCTT TGCCCTGCTG AGCGGTCGCG CCCTGCCGAC GCCGGCGGCG CACCAGAAGC GCGAGCATCT CCTGGCGTGG CTGCCGGAGA GGCTGAAGGA GATTGATTCT CTGGAGTTTC TGCCCGGGAA GGTGCTGCAC GATGTTTACA TGCACTGCTC CTATGCAGAT TTACCGGAAA AGCACCGCAT CAAGCAGGAA ATCAACCGGC TGACGGCCCG GGCACTGGAG CAGACTTACG CAGACTGTCT GCCGGTACGC GCGCCGGAAG CGGCGCGTCA GAAACCGGTG CTGGCGGTGG TGCTGGAGTG GTTTACCTGT CAGCACAGCA TTTACCGGAC CCACTCCACC TCCATGCGCG CCCTGCGGGA GCACTTCCAC CTGCTGGGTA TTGCGCAGCC CGGAGCGACG GACGAGATTA CCCGGGAGGT GTTTGATGAG TTCCGGGAGC TGTCGGCGGA GAACGTTGTC GGGGATGCCA TCCGCTGCCT GAGTGAGGTG CGCCCGGACG TGATTTACTA CCCGTCCGTG GGCATGTTCC CGCTGACCGT CTACCTGACG GCCCTGCGCC TGGCTCCGTT GCAGCTGATG GCGCTGGGAC ACCCGGCCAC CACCTGGTCT GAGCATATTG ATGGTGTCCT GGTGGAGGAA GACTACCTGG GAGACCCGGC ATGCTTCAGC GAGACGGTCT GTGCCGTCCC GAAGGATGCG ATACCGTATA TTCCGCCGGC CAGCACGGAA CGTGTCCTGC CGGAACGCAC ACCATTCCGT GACCGGGCGA AGGCGGCGTG GCCTGCGGCC CTGCCGGTGC GGGTGGCTGT CTGTGCATCG GTCATGAAAA TCAACCCGGG CTTCCTGGAT ACCCTGCGGG AAATCAGCGA CAGAAGCCGG GTGCCGGTTC AGTTCTGCTT CTGGATGGGC TTTGCTCAGG GGCTGACGCT GGACTACCTG CGCCGGGCTA TCCGTCAGGC GCTGCCGACG GCAGAAGTGA ATGCGCACAT GCCAGTCCAG GCATACCAGC AGGCGCTGAA CAGCTGTGAG CTGTTTGTGA ACCCGTTCCC GTTTGGCAAC ACCAACGGCC TGGTGGATAC CGTGCGCCAG GGGCTGCCCG GGGTGTGCAT GACGGGGCCG GAAGTCCACA CCCATATTGA TGAGGGGCTG TTCAGACGCC TGGGCCTGCC GGAGGCCCTG ATTGCCCGCG ACCGCGAGGA GTACATCACG GCGGTACTGT CCCTGACGGA GACGCCACGC CTGCGCGAGC GTCTGCAGAA ATACCTGACG GAAAACGACG TGGAGAAGGT GCTGTTTGAA GGGCGTCCGG ATAAATTCGC GGAAAGGGTA TGGCAGTTGT GGGAGGCGCG CAGCCATCGT CAGGAGGAGG GTGCCGAATG A
|
Protein sequence | MSHKTDTAPV QEQAGLTFRL ETFEWQVHQG LNEEAARSLI SLLQLLDRHY AQWGESFSAW APGMTAEEIN PHLCTRIAGA ITALFSRPGF RVSDGGFAEL MDYHRWLAII FAVSDYRHGD HIIRNINAAG GGVVAPLTLN ADNLQLFCLS YYPDSQIALQ PEPLWQYDRQ TVVRLFFALL SGRALPTPAA HQKREHLLAW LPERLKEIDS LEFLPGKVLH DVYMHCSYAD LPEKHRIKQE INRLTARALE QTYADCLPVR APEAARQKPV LAVVLEWFTC QHSIYRTHST SMRALREHFH LLGIAQPGAT DEITREVFDE FRELSAENVV GDAIRCLSEV RPDVIYYPSV GMFPLTVYLT ALRLAPLQLM ALGHPATTWS EHIDGVLVEE DYLGDPACFS ETVCAVPKDA IPYIPPASTE RVLPERTPFR DRAKAAWPAA LPVRVAVCAS VMKINPGFLD TLREISDRSR VPVQFCFWMG FAQGLTLDYL RRAIRQALPT AEVNAHMPVQ AYQQALNSCE LFVNPFPFGN TNGLVDTVRQ GLPGVCMTGP EVHTHIDEGL FRRLGLPEAL IARDREEYIT AVLSLTETPR LRERLQKYLT ENDVEKVLFE GRPDKFAERV WQLWEARSHR QEEGAE
|
| |