Gene B21_00614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00614 
Symbolybl24 
ID8115652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp647013 
End bp648890 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content50% 
IMG OID644846887 
Producthypothetical protein 
Protein accessionYP_002998460 
Protein GI251784156 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCAT CTTCGGTTAA GCCGTTAAAT GTTCAATTAC CCGCAATAAC CCTTATCCTT 
TTTGCGCTCT GTGTTGGGAT ATTTTGTTAC CTCGCACAAT GGATGAGTTA TGAAGAAGTC
GATCAATCCG CACTCATCCA TCTCGGTGCT AACGTTGCTT CACTCTCGTT GTCGGGTGAA
CCCTGGCGCT TATTGAGCAG TGTCTTTCTG CACAGTAGTT TTTCCCATTT GCTGATGAAT
ATGTTTGCAC TCCTGGTGGT GGGGGCAGTG ACGGAACGGA TACTGGGGAA ATGGCGACTT
CTGATTATTT GGTTATTCTC CGGCGTCTTT GGTGGGCTCA TCAGCGCCTG TTATGCGTTA
CGCGATAGTG ATCAGATAGT CATCAGCGTT GGGGCATCCG GGGCAATTAT GGGAATAGCT
GGCGCTGCGA TAGCAACACA GCTTGCTTCA GGTACGGGCA CACACCATAA AAACCAGCGG
CGAGTATTTC CTCTGTTGGG TATGGTGGCG CTGACACTGT TGTACGGTGC CCGGCAAACA
GGAATAGATA ACGCTTGCCA CATTGGCGGC CTGATTGCGG GTGGCGCGTT GGGTTGGCTG
AGCGCGCGTT TATCTGGGCA AAACCGACTC GTTACGGAAG GCGGGATTAT TGTTGCGGGC
AGTCTTCTTC TGACCGGGGC TATCTGGCTT GCGCAGCAGC AGATGGATGA GTCAGTTTTA
CAGGTCAGGC AAAGCCTGCG TGAAGAGTTT TATCCGCAGG AGATTGAACA AGAGCGACGA
CAAAAAAAAC AACAGTTAGC GGAGGAACGC AACGCCCTCA GGGAAACATT ATCCGCTCCG
GTAAGTCGTG AACAGGCCAG TGGTGATTTG CTCGCTGAGA TTGCCGATAT CCATGATATG
GCGATCAGTC GGGATGGTAA TACGTTGTAT GCCGCAATTG AAAACACCAA CAGCATTGTT
GTTTTCGACC TCGGACAAAA GAAAATCCTG CATACCTTTA CAGCCCCCAT AGCGAAAGAA
AAGTCAGTCA AACATTGTGG TGGCTGTAAA GATCAGGGCG TCAGATCGCT GACGCTAAGC
CCGGATGAAA CGTTGCTTTA TGCGACTTCA TTTGAAGCGA ATGCGTTATC GGTCATTAAC
GTGGCGACGG GGGAGATTAT TCAGTCGATT ACCACCGGTG CACATCCTGA CAGCCTTATC
CTCTCGCGTG ATGGCACAAA AGCCTGGGTG ATGAATCGCA CCAGTAATAG TGTGTCAGCC
ATTGATCTGG TGGCTTATCA GCATGTGGCG GATATCCCGC TGGAGAAATA CGACGGGACG
GGGACGAGTA ATAAACCTGG TGCCTGGGTT ATGGCACTTT CCCCGGATGA AAAAATATTG
TTGATACCCG GTATGGTCAG AGGTGACATT GTACGCATCA ATACCATCAC GCATCAGAAA
GAAGACTTTC CCGCAGGTGA TGCGCGTGGA ACGATATCGG CGATGCGTTT TCGACCTGAA
AACGGGGATG TAATTTTTGC CGACAGCCAG GGGATTTCAC GTATAAGAGT TGGGGATCAG
CAAGCCAGCA TTATGACGCA ATGGTGTAGC AGGAGCGTTT ATTCCGTTGA GGGTATTAGC
CCGGACGGTC AGTATTTAGC GTTGGTGTCA TATGGCTTGC AAGGTTATGT CATCCTGCTC
AATATTAATG TCGGGCAGAT TGTTGGCGTT TATCCTGCCA GCTACGTTAA TCACCTTCGT
TTTTCGGCGG ATGGTAGAAA GATATTTGTT ATGGCGAAGA ACGGGTTAAT CCAAATGGAC
AGGACGCTCT CGCTTGATCC GCAGGCAGTT ATTCGTCATC CCCAATATGG CAATGTGGCT
TGTATCCCTG AACCGTAA
 
Protein sequence
MSASSVKPLN VQLPAITLIL FALCVGIFCY LAQWMSYEEV DQSALIHLGA NVASLSLSGE 
PWRLLSSVFL HSSFSHLLMN MFALLVVGAV TERILGKWRL LIIWLFSGVF GGLISACYAL
RDSDQIVISV GASGAIMGIA GAAIATQLAS GTGTHHKNQR RVFPLLGMVA LTLLYGARQT
GIDNACHIGG LIAGGALGWL SARLSGQNRL VTEGGIIVAG SLLLTGAIWL AQQQMDESVL
QVRQSLREEF YPQEIEQERR QKKQQLAEER NALRETLSAP VSREQASGDL LAEIADIHDM
AISRDGNTLY AAIENTNSIV VFDLGQKKIL HTFTAPIAKE KSVKHCGGCK DQGVRSLTLS
PDETLLYATS FEANALSVIN VATGEIIQSI TTGAHPDSLI LSRDGTKAWV MNRTSNSVSA
IDLVAYQHVA DIPLEKYDGT GTSNKPGAWV MALSPDEKIL LIPGMVRGDI VRINTITHQK
EDFPAGDARG TISAMRFRPE NGDVIFADSQ GISRIRVGDQ QASIMTQWCS RSVYSVEGIS
PDGQYLALVS YGLQGYVILL NINVGQIVGV YPASYVNHLR FSADGRKIFV MAKNGLIQMD
RTLSLDPQAV IRHPQYGNVA CIPEP