Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4560 |
Symbol | arpA |
ID | 5587800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4556998 |
End bp | 4558572 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640928178 |
Product | ankyrin repeat-containing protein |
Protein accession | YP_001465510 |
Protein GI | 157156537 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACTC GTATTCCTCG CAGTTCTTTC TCTGTAAATA TTAATAATAC AGCCCAGACA AATGAACACC AAAACCTGAG TGAATTGTTT TATAAAGAAC TCGAGGATAA ATTTTCTGGC AAGGAGCTGG CGACACCTCT ATTAAAAAGC TTCTCAGAGA ACTGTAGACA AAATGGTCGT CATATCTTTA GCAACAAGGA TTTTGTCATT AAATTTTCCA CGTCCGTCTT ACAAGCTGAT AAGAAAGAAA TTACGATAAT TAATAAAAAC GAAAACACGA CACTTACTCA AACCATTGCC CCAATATTTG AAGAATACCT AATGGAAATT TTACCTCAAC GCTCAGACAC TCTTGATAAA CAAGAATTAA ACCTAAAATC AGATAGAAAA GAAAAAGAAT TCCCAAGAAT TAAACTTAAT GGTCAATGTT ATTTTCCGGG GCGACCCCAA AACCGTATAG TATGCCGACA CATTGCTGCA CAATATATTA ATGATATTTA TCAGAATGTT GATTACAAAC CCCATCAAGA TGATTACTCT TCAGCTGAAA AATTTCTCAC GCACTTCAAC AAAAAATGCA AAAACCAGAC TTTGGCGTTG GTTTCCAGCC GTCCTGAGGG GCGTTGCGTT GCTGCCTGCG GTGATTTCGG GCTAGTTATG AAAGCATATT TTGACAAGAT GGAATCAAAT GGCATCAGTG TTATGGCAGC CATATTACTG GTGGATAACC ATGCTTTGAC GGTCCGGCTA AGAATAAAGA ACACAACTGA AGGATGTACC CATTACGTGG TTTCGGTTTA TGATCCTAAT GTAACTAACG ATAAAATAAG AATTATGAGC GAAAGCAAAG AGGATATTAA ACACTATTCT CTGATGGATT TTATGAATGT AGATTATAGC CTCCTGAAAT GGTCAAATGA TCATGTTATT AACCAATCTG TTGCAATAAT TCCAGCACTT CCGAAAGAAC AGCTATTGAT GTTAAAAGGA TCTGTGGATG AAATAACCCC TCCATTATCA CCAGCAACGA TGAATTTGCT AATGGCAATT GGTCAGAATC ACCAACTTAC GCAACTGATG ATTCAGCTCC AGAAAATGCC AGAACTACAT AGAACAGAAA TGTTGACTGC CTATAATAGT GGACATATGA ACGTTATTAA TACTATTTTT AACGCATTAC CCACTCTGTT TAATACGTTT AAATTCGATA AAAAAAATAT GAAGCCCCTC CTCCTGGCAA ATAATTCTAA TGAATATCCC GGTTTGTTTT CAGCGATACA GCATAAACAA CAAAATGTTG TAGAGACGGT TTATCTTGCT TTATCTGACC ATGCACGCCT GTTTGGATTT ACCGCTGAAG ATATTATGGA TTTTTGGCAA CACAAAGCGC CACAAAAATA CTCTGCCTTT GAGTTGGCTT TTGAATTTGG TCACCGGGTT ATTGCTGAAT TAATCCTTAA TACATTAAAT AAGATGGCTG AAAGCTTTGG CTTTACGGAT AACCCTCGAT ACATTGCGGA GAAAAATTAT ATGGAAGCTT TACTCAAAAA AGCATCTCCC CATACCGTAC GCTAA
|
Protein sequence | MITRIPRSSF SVNINNTAQT NEHQNLSELF YKELEDKFSG KELATPLLKS FSENCRQNGR HIFSNKDFVI KFSTSVLQAD KKEITIINKN ENTTLTQTIA PIFEEYLMEI LPQRSDTLDK QELNLKSDRK EKEFPRIKLN GQCYFPGRPQ NRIVCRHIAA QYINDIYQNV DYKPHQDDYS SAEKFLTHFN KKCKNQTLAL VSSRPEGRCV AACGDFGLVM KAYFDKMESN GISVMAAILL VDNHALTVRL RIKNTTEGCT HYVVSVYDPN VTNDKIRIMS ESKEDIKHYS LMDFMNVDYS LLKWSNDHVI NQSVAIIPAL PKEQLLMLKG SVDEITPPLS PATMNLLMAI GQNHQLTQLM IQLQKMPELH RTEMLTAYNS GHMNVINTIF NALPTLFNTF KFDKKNMKPL LLANNSNEYP GLFSAIQHKQ QNVVETVYLA LSDHARLFGF TAEDIMDFWQ HKAPQKYSAF ELAFEFGHRV IAELILNTLN KMAESFGFTD NPRYIAEKNY MEALLKKASP HTVR
|
| |