Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1471 |
Symbol | |
ID | 6067225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 1622892 |
End bp | 1624448 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641600891 |
Product | hypothetical protein |
Protein accession | YP_001724461 |
Protein GI | 170019507 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2200] FOG: EAL domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000861353 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0219942 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATAC GCGCTCCCAA TTTTGGACGT AAGCTCCTGC TTACCTGCAT TGTTGCAGGC GTAATGATTG CGATACTGGT GAGTTGCCTT CAGTTTTTAG TGGCCTGGCA TAAGCACGAA GTCAAATACG ACACACTGAT TACCGACGTA CAAAAGTATC TCGATACCTA TTTTGCCGAC CTGAAATCCA CTACTGACCG GCTCCAGCCG CTGACCTTAG ATACCTGCCA GCAAGCTAAC CCCGAACTGA CCGCCCGCGC AGCGTTTAGC ATGAATGTCC GAACGTTTGT GCTGGTGAAA GATAAAAAAA CATTCTGTTC ATCTGCGACC GGTGAGATGG ACATTCCACT CAATGAATTG ATTCCGGCGC TCGACATTAA TAAAAACGTC GATATGGCGA TCTTACCCGG CACGCCGATG GTGCCGAACA AACCCGCAAT CGTCATCTGG TATCGCAACC CTTTGCTGAA AAATAGCGGC GTCTTTGCCG CTCTGAATCT CAACCTGACG CCTTCACTCT TTTATAGTTC ACGGCAGGAA GATTACGATG GCGTCGCCCT CATTATTGGC AATACTGCGC TATCTACCTT TTCTTCACGT TTGATGAACG TTAACGAATT AACCGACATG CCAGTCCGTG AAACTAAAAT TGCGGGCATT CCTCTGACCG TTCGGCTTTA TGCAGATGAC TGGACATGGA ACGATGTGTG GTACGCATTT TTACTGGGCG GCATGAGTGG AACTGTCGTT GGCCTGCTCT GCTATTACCT GATGAGCGTA CGTATGCGCC CCGGCAGAGA AATCATGACC GCCATCAAGC GCGAACAATT TTACGTGGCG TATCAACCGG TGGTGGATAC ACAAGCTTTG CGAGTAACGG GCCTGGAAGT ACTGCTACGC TGGCGGCATC CTGTCGCGGG AGAAATTCCC CCGGATGCCT TCATTAACTT TGCCGAATCG CAAAAGATGA TTGTGCCGCT GACTCAGCAC CTGTTTGAGT TAATTGCCCG CGATGCCGCA GAATTAGAAA AAGTGCTGCC GGTAGGCGTC AAATTTGGTA TTAACATTGC GCCGGACCAT CTGCACAGCG AAAGCTTTAA AGCAGATATC CAGAAACTGC TCACTTCCCT GCCCGCACAC CATTTCCAGA TTGTGCTGGA AATTACCGAG CGCGATATGT TGAAAGAGCA AGAAGCCACA CAACTCTTCG CCTGGCTGCA CTCGGTCGGC GTAGAAATTG CTATTGATGA CTTCGGCACC GGGCACAGCG CGCTTATCTA TCTTGAGCGT TTTACGCTCG ATTATCTGAA AATTGACCGT GGATTTATCA ACGCCATCGG TACGGAAACG ATCACTTCAC CCGTACTTGA CGCGGTGCTG ACGCTGGCGA AACGCCTCAA TATGCTGACG GTTGCTGAAG GGGTCGAAAC GCCGGAACAG GCGCGATGGC TAAGCGAACG CGGCGTTAAT TTCATGCAAG GCTACTGGAT TAGTCGCCCG TTACCGCTGG ACGATTTTGT TCGCTGGCTG AAGAAACCGT ATACGCCGCA GTGGTAA
|
Protein sequence | MFIRAPNFGR KLLLTCIVAG VMIAILVSCL QFLVAWHKHE VKYDTLITDV QKYLDTYFAD LKSTTDRLQP LTLDTCQQAN PELTARAAFS MNVRTFVLVK DKKTFCSSAT GEMDIPLNEL IPALDINKNV DMAILPGTPM VPNKPAIVIW YRNPLLKNSG VFAALNLNLT PSLFYSSRQE DYDGVALIIG NTALSTFSSR LMNVNELTDM PVRETKIAGI PLTVRLYADD WTWNDVWYAF LLGGMSGTVV GLLCYYLMSV RMRPGREIMT AIKREQFYVA YQPVVDTQAL RVTGLEVLLR WRHPVAGEIP PDAFINFAES QKMIVPLTQH LFELIARDAA ELEKVLPVGV KFGINIAPDH LHSESFKADI QKLLTSLPAH HFQIVLEITE RDMLKEQEAT QLFAWLHSVG VEIAIDDFGT GHSALIYLER FTLDYLKIDR GFINAIGTET ITSPVLDAVL TLAKRLNMLT VAEGVETPEQ ARWLSERGVN FMQGYWISRP LPLDDFVRWL KKPYTPQW
|
| |