Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3514 |
Symbol | |
ID | 5588974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3522549 |
End bp | 3524210 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640927141 |
Product | SPFH domain-containing protein/band 7 family protein |
Protein accession | YP_001464511 |
Protein GI | 157157664 |
COG category | [S] Function unknown |
COG ID | [COG2268] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGATA TTGTTAATTC TGTGTCCTCC TGGATGTTTA CCGCGATTAT TGCCGTATGC ATTCTGTTTA TTATTGGAAT TATTTTCGCC AGGCTCTATC GTCGCGCTTC GGCAGAGCAA GCTTTTGTTC GTACTGGTTT AGGTGGGCAA AAAGTGGTGA TGAGCGGTGG CGCAATCGTG ATGCCGATCT TTCATGAAAT AATCCCCATC AATATGAATA CTCTGAAGCT GGAAGTCAGC CGCTCAACCA TTGATAGCCT GATTACGAAA GATCGTATGC GTGTCGATGT AGTAGTCGCT TTCTTTGTAC GGGTAAAACC GTCTGTAGAA GGCATCGCCA CTGCGGCGCA GACGTTGGGG CAACGCACGC TATCACCAGA AGACTTACGT ATGTTGGTTG AAGATAAATT TGTCGATGCC CTCCGTGCAA CAGCTGCACA AATGACCATG CATGAGTTAC AGGATACCCG CGAGAACTTC GTGCAGGGAG TACAAAATAC TGTCGCAGAA GACCTGTCGA AAAACGGTCT TGAGCTGGAA AGCGTTTCAC TTACCAACTT TAACCAGACC TCGAAAGAAC ATTTCAATCC GAACAATGCC TTTGACGCCG AAGGTTTAAC CAAACTGACT CAGGAGACGG AGCGCCGTCG TCGCGAACGT AACGAAGTTG AACAGGATGT AGAAGTTGCG GTGCGTGAGA AAAACCGTGA TGCACTTTCG CGCAAGTTGG AGATTGAACA ACAAGAAGCG TTTATGACGC TTGAGCAGGA GCAGCAGGTT AAAACCCGTA CTGCCGAACA GAATGCACGT ATTGCGGCTT TTGAAGCTGA ACGTCGTCGT GAAGCAGAGC AGACGCGAAT TCTGGCTGAA CGACAAATTC AGGAAACGGA GATCGAGCGC GAGCAGGCCG TCCGCTCAAG AAAGGTTGAG GCTGAACGTG AAGTTCGTAT TAAAGAGATC GAACAACAGC AGGTCACCGA AATCGCCAAC CAGACGAAAT CGATCGCTAT TGCCGCCAAA TCGGAACAAC AGTCCCAGGC AGAAGCGCGT GCTAACCTTG CACTCGCAGA AGCAGTAAGT GCGCAACAAA ACGTAGAAAC CACTCGCCAG ACCGCAGAAG CCGATCGTGC TAAACAAGTT GCCCTAATCG CTGCCGCGCA GGATGCAGAA ACCAAAGCGG TTGAACTGAC CGTGCGGGCG AAAGCAGAAA AAGAAGCCGC AGAGATGCAG GCGGCGGCTA TCGTTGAGTT AGCCGAAGCT ACACGTAAAA AGGGTCTGGC GGAAGCAGAA GCACAACGTG CGCTGAACGA TGCTATCAAC GTACTTTCTG ATGAACAAAC CAGCCTTAAA TTCAAACTGG CCTTGTTGCA GGCGCTACCT GCGGTAATAG AAAAATCCGT TGAGCCGATG AAATCTATTG ACGGTATCAA GATTATTCAG GTCGATGGTC TGAATCGTGG CAGCGCTGCG GGTGATGCAA ACACGGGTAA TGTGGGGGGC GGAAACCTGG CGGAACAAGC ATTATCAGCC GCTCTCTCTT ACCGCACACA GGCACCGCTG ATTGACTCCT TGCTCAATGA AATTGGCGTT TCAGGCGGCT CACTGGCGGC ATTGACTTCA CCCTTAACCT CAACAACTCC CGTCGCCGAA AACGTAGAAT AA
|
Protein sequence | MDDIVNSVSS WMFTAIIAVC ILFIIGIIFA RLYRRASAEQ AFVRTGLGGQ KVVMSGGAIV MPIFHEIIPI NMNTLKLEVS RSTIDSLITK DRMRVDVVVA FFVRVKPSVE GIATAAQTLG QRTLSPEDLR MLVEDKFVDA LRATAAQMTM HELQDTRENF VQGVQNTVAE DLSKNGLELE SVSLTNFNQT SKEHFNPNNA FDAEGLTKLT QETERRRRER NEVEQDVEVA VREKNRDALS RKLEIEQQEA FMTLEQEQQV KTRTAEQNAR IAAFEAERRR EAEQTRILAE RQIQETEIER EQAVRSRKVE AEREVRIKEI EQQQVTEIAN QTKSIAIAAK SEQQSQAEAR ANLALAEAVS AQQNVETTRQ TAEADRAKQV ALIAAAQDAE TKAVELTVRA KAEKEAAEMQ AAAIVELAEA TRKKGLAEAE AQRALNDAIN VLSDEQTSLK FKLALLQALP AVIEKSVEPM KSIDGIKIIQ VDGLNRGSAA GDANTGNVGG GNLAEQALSA ALSYRTQAPL IDSLLNEIGV SGGSLAALTS PLTSTTPVAE NVE
|
| |