Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2989 |
Symbol | |
ID | 6065853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3267154 |
End bp | 3269031 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641602406 |
Product | YVTN beta-propeller repeat-containing protein |
Protein accession | YP_001725941 |
Protein GI | 170020987 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCAT CTTCGGTTAA GCCGTTAAAT GTTCAATTAC CCGCAATAAC CCTTATCCTT TTTGCGCTCT GTGTTGGGAT ATTTTGTTAC CTCGCACAAT GGATGAGTTA TGAAGAAGTC GATCAATCCG CACTCATCCA TCTCGGTGCT AACGTTGCTT CACTCTCGTT GTCGGGTGAA CCCTGGCGCT TATTGAGCAG TGTCTTTCTG CACAGTAGTT TTTCCCATTT GCTGATGAAT ATGTTTGCAC TCCTGGTGGT GGGGGCAGTG ACGGAACGGA TACTGGGGAA ATGGCGACTT CTGATTATTT GGTTATTCTC CGGCGTCTTT GGTGGGCTCA TCAGCGCCTG TTATGCGTTA CGCGATAGTG ATCAGATAGT CATCAGCGTT GGGGCATCCG GGGCAATTAT GGGAATAGCT GGCGCTGCGA TAGCAACACA GCTTGCTTCA GGTACGGGCA CACACCATAA AAACCAGCGG CGAGTATTTC CTCTGTTGGG TATGGTGGCG CTGACACTGT TGTACGGTGC CCGGCAAACA GGAATAGATA ACGCTTGCCA CATTGGCGGC CTGATTGCGG GTGGCGCGTT GGGTTGGCTG AGCGCGCGTT TATCTGGGCA AAACCGACTC GTTACGGAAG GCGGGATTAT TGTTGCGGGC AGTCTTCTTC TGACCGGGGC TATCTGGCTT GCGCAGCAGC AGATGGATGA GTCAGTTTTA CAGGTCAGGC AAAGCCTGCG TGAAGAGTTT TATCCGCAGG AGATTGAACA AGAGCGACGA CAAAAAAAAC AACAGTTAGC GGAGGAACGC AACGCCCTCA GGGAAACATT ATCCGCTCCG GTAAGTCGTG AACAGGCCAG TGGTGATTTG CTCGCTGAGA TTGCCGATAT CCATGATATG GCGATCAGTC GGGATGGTAA TACGTTGTAT GCCGCAATTG AAAACACCAA CAGCATTGTT GTTTTCGACC TCGGACAAAA GAAAATCCTG CATACCTTTA CAGCCCCCAT AGCGAAAGAA AAGTCAGTCA AACATTGTGG TGGCTGTAAA GATCAGGGCG TCAGATCGCT GACGCTAAGC CCGGATGAAA CGTTGCTTTA TGCGACTTCA TTTGAAGCGA ATGCGTTATC GGTCATTAAC GTGGCGACGG GGGAGATTAT TCAGTCGATT ACCACCGGTG CACATCCTGA CAGTCTTATC CTCTCGCGTG ATGGCACAAA AGCCTGGGTG ATGAATCGCA CCAGTAATAG TGTGTCAGCG ATTGATCTGG TGACTTATCA GCATGTGGCG GATATCCCGC TGGAGAAATA CGACGGGACG GGGACGAGCG GTAAACCTGG TGCCTGGGTT ATGGCACTTT CCCCGGATGA AAAAACATTG TTGATACCCG GTATGGTCAG AGGTGACATT GTACGCATCA ATACCATCAC GCATCAGAAA GAAGACTTTC CCGCAGGTGA TGCGCGTGGA ACGATATCGG CGATGCGTTT TCGACCTGAA AACGGGGATG TAATTTTTGC CGACAGCCTG GGGATTTCAC GTATAAGAGT TGGGGATCAG CAAGCCAGCA TTATGACGCA ATGGTGTAGC AGGAGCGTTT ATTCCGTTGA GGGTATTAGC CCGGACGGTC AGTATTTAGC GTTGGTGTCA TATGGCTTGC AAGGTTATGT CATCCTGCTC AATATTAATG TCGGGCAGAT TGTTGGCGTT TATCCTGCCA GCTACGTTAA TCACCTTCGT TTTTCGGCGG ATGGTAGAAA GATATTTGTT ATGGCGAAGA ACGGGTTAAT CCAAATGGAC AGGACGCTCT CGCTTGATCC GCAGGCAGTT ATTCGTCATC CCCAATATGG CAATGTGGCT TGTATCCCTG AACCGTAA
|
Protein sequence | MSASSVKPLN VQLPAITLIL FALCVGIFCY LAQWMSYEEV DQSALIHLGA NVASLSLSGE PWRLLSSVFL HSSFSHLLMN MFALLVVGAV TERILGKWRL LIIWLFSGVF GGLISACYAL RDSDQIVISV GASGAIMGIA GAAIATQLAS GTGTHHKNQR RVFPLLGMVA LTLLYGARQT GIDNACHIGG LIAGGALGWL SARLSGQNRL VTEGGIIVAG SLLLTGAIWL AQQQMDESVL QVRQSLREEF YPQEIEQERR QKKQQLAEER NALRETLSAP VSREQASGDL LAEIADIHDM AISRDGNTLY AAIENTNSIV VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLTLS PDETLLYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA IDLVTYQHVA DIPLEKYDGT GTSGKPGAWV MALSPDEKTL LIPGMVRGDI VRINTITHQK EDFPAGDARG TISAMRFRPE NGDVIFADSL GISRIRVGDQ QASIMTQWCS RSVYSVEGIS PDGQYLALVS YGLQGYVILL NINVGQIVGV YPASYVNHLR FSADGRKIFV MAKNGLIQMD RTLSLDPQAV IRHPQYGNVA CIPEP
|
| |