Gene EcolC_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2989 
Symbol 
ID6065853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3267154 
End bp3269031 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content50% 
IMG OID641602406 
ProductYVTN beta-propeller repeat-containing protein 
Protein accessionYP_001725941 
Protein GI170020987 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAT CTTCGGTTAA GCCGTTAAAT GTTCAATTAC CCGCAATAAC CCTTATCCTT 
TTTGCGCTCT GTGTTGGGAT ATTTTGTTAC CTCGCACAAT GGATGAGTTA TGAAGAAGTC
GATCAATCCG CACTCATCCA TCTCGGTGCT AACGTTGCTT CACTCTCGTT GTCGGGTGAA
CCCTGGCGCT TATTGAGCAG TGTCTTTCTG CACAGTAGTT TTTCCCATTT GCTGATGAAT
ATGTTTGCAC TCCTGGTGGT GGGGGCAGTG ACGGAACGGA TACTGGGGAA ATGGCGACTT
CTGATTATTT GGTTATTCTC CGGCGTCTTT GGTGGGCTCA TCAGCGCCTG TTATGCGTTA
CGCGATAGTG ATCAGATAGT CATCAGCGTT GGGGCATCCG GGGCAATTAT GGGAATAGCT
GGCGCTGCGA TAGCAACACA GCTTGCTTCA GGTACGGGCA CACACCATAA AAACCAGCGG
CGAGTATTTC CTCTGTTGGG TATGGTGGCG CTGACACTGT TGTACGGTGC CCGGCAAACA
GGAATAGATA ACGCTTGCCA CATTGGCGGC CTGATTGCGG GTGGCGCGTT GGGTTGGCTG
AGCGCGCGTT TATCTGGGCA AAACCGACTC GTTACGGAAG GCGGGATTAT TGTTGCGGGC
AGTCTTCTTC TGACCGGGGC TATCTGGCTT GCGCAGCAGC AGATGGATGA GTCAGTTTTA
CAGGTCAGGC AAAGCCTGCG TGAAGAGTTT TATCCGCAGG AGATTGAACA AGAGCGACGA
CAAAAAAAAC AACAGTTAGC GGAGGAACGC AACGCCCTCA GGGAAACATT ATCCGCTCCG
GTAAGTCGTG AACAGGCCAG TGGTGATTTG CTCGCTGAGA TTGCCGATAT CCATGATATG
GCGATCAGTC GGGATGGTAA TACGTTGTAT GCCGCAATTG AAAACACCAA CAGCATTGTT
GTTTTCGACC TCGGACAAAA GAAAATCCTG CATACCTTTA CAGCCCCCAT AGCGAAAGAA
AAGTCAGTCA AACATTGTGG TGGCTGTAAA GATCAGGGCG TCAGATCGCT GACGCTAAGC
CCGGATGAAA CGTTGCTTTA TGCGACTTCA TTTGAAGCGA ATGCGTTATC GGTCATTAAC
GTGGCGACGG GGGAGATTAT TCAGTCGATT ACCACCGGTG CACATCCTGA CAGTCTTATC
CTCTCGCGTG ATGGCACAAA AGCCTGGGTG ATGAATCGCA CCAGTAATAG TGTGTCAGCG
ATTGATCTGG TGACTTATCA GCATGTGGCG GATATCCCGC TGGAGAAATA CGACGGGACG
GGGACGAGCG GTAAACCTGG TGCCTGGGTT ATGGCACTTT CCCCGGATGA AAAAACATTG
TTGATACCCG GTATGGTCAG AGGTGACATT GTACGCATCA ATACCATCAC GCATCAGAAA
GAAGACTTTC CCGCAGGTGA TGCGCGTGGA ACGATATCGG CGATGCGTTT TCGACCTGAA
AACGGGGATG TAATTTTTGC CGACAGCCTG GGGATTTCAC GTATAAGAGT TGGGGATCAG
CAAGCCAGCA TTATGACGCA ATGGTGTAGC AGGAGCGTTT ATTCCGTTGA GGGTATTAGC
CCGGACGGTC AGTATTTAGC GTTGGTGTCA TATGGCTTGC AAGGTTATGT CATCCTGCTC
AATATTAATG TCGGGCAGAT TGTTGGCGTT TATCCTGCCA GCTACGTTAA TCACCTTCGT
TTTTCGGCGG ATGGTAGAAA GATATTTGTT ATGGCGAAGA ACGGGTTAAT CCAAATGGAC
AGGACGCTCT CGCTTGATCC GCAGGCAGTT ATTCGTCATC CCCAATATGG CAATGTGGCT
TGTATCCCTG AACCGTAA
 
Protein sequence
MSASSVKPLN VQLPAITLIL FALCVGIFCY LAQWMSYEEV DQSALIHLGA NVASLSLSGE 
PWRLLSSVFL HSSFSHLLMN MFALLVVGAV TERILGKWRL LIIWLFSGVF GGLISACYAL
RDSDQIVISV GASGAIMGIA GAAIATQLAS GTGTHHKNQR RVFPLLGMVA LTLLYGARQT
GIDNACHIGG LIAGGALGWL SARLSGQNRL VTEGGIIVAG SLLLTGAIWL AQQQMDESVL
QVRQSLREEF YPQEIEQERR QKKQQLAEER NALRETLSAP VSREQASGDL LAEIADIHDM
AISRDGNTLY AAIENTNSIV VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLTLS
PDETLLYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA
IDLVTYQHVA DIPLEKYDGT GTSGKPGAWV MALSPDEKTL LIPGMVRGDI VRINTITHQK
EDFPAGDARG TISAMRFRPE NGDVIFADSL GISRIRVGDQ QASIMTQWCS RSVYSVEGIS
PDGQYLALVS YGLQGYVILL NINVGQIVGV YPASYVNHLR FSADGRKIFV MAKNGLIQMD
RTLSLDPQAV IRHPQYGNVA CIPEP