Gene B21_03155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03155 
SymbolyheS 
ID8114052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3340100 
End bp3342013 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content55% 
IMG OID644849337 
Producthypothetical protein 
Protein accessionYP_003000910 
Protein GI251786606 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0843095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTTT TCTCCTCGTT ACAAATTCGT CGCGGCGTGC GCGTCCTGCT GGATAATGCC 
ACCGCCACCA TCAACCCCGG GCAGAAAGTC GGCCTGGTGG GTAAAAACGG CTGTGGTAAA
TCTACCCTGC TGGCATTGCT GAAAAATGAA ATCAGCGCCG ACGGCGGCAG CTACACCTTT
CCGGGAAGCT GGCAACTGGC GTGGGTGAAT CAGGAAACGC CGGCGTTACC GCAAGCGGCG
CTGGAATATG TCATTGACGG AGACCGTGAA TATCGTCAAC TGGAAGCGCA GCTACACGAC
GCCAACGAAC GTAACGACGG GCACGCCATT GCGACCATTC ATGGCAAGCT GGATGCTATT
GACGCATGGA GTATTCGCTC CCGTGCCGCC AGCCTGCTGC ACGGCCTCGG TTTCAGCAAT
GAACAACTGG AGCGCCCGGT AAGTGATTTC TCCGGTGGCT GGCGTATGCG TCTTAACCTT
GCCCAGGCGC TGATTTGCCG TTCAGACTTG CTGCTGCTCG ACGAACCGAC TAACCACCTC
GATCTCGATG CCGTTATCTG GCTGGAAAAA TGGCTGAAGA GCTATCAGGG CACGCTGATC
CTGATCTCTC ACGACCGCGA CTTCCTCGAT CCGATCGTCG ATAAAATTAT TCATATCGAA
CAACAAAGCA TGTTCGAGTA CACCGGCAAC TACAGTTCGT TTGAAGTACA GCGCGCCACC
CGTCTGGCGC AGCAACAAGC GATGTACGAA AGCCAGCAGG AACGCGTAGC GCATCTGCAA
AGTTATATCG ACCGTTTCCG TGCCAAAGCC ACCAAAGCGA AGCAGGCCCA GAGCCGCATT
AAGATGCTCG AGCGTATGGA GCTAATTGCC CCCGCGCACG TCGACAACCC GTTCCGCTTT
AGCTTCCGCG CGCCGGAAAG CCTGCCAAAT CCGTTACTGA AGATGGAAAA AGTCAGCGCG
GGCTATGGCG ATCGCATTAT TCTCGACTCG ATTAAACTGA ACCTGGTGCC CGGCTCGCGT
ATTGGTCTGT TAGGCCGCAA TGGCGCGGGT AAATCGACAT TAATCAAACT GTTAGCCGGT
GAACTTGCGC CAGTCAGCGG TGAAATTGGT CTGGCGAAAG GGATCAAACT CGGCTACTTC
GCCCAGCATC AACTTGAATA CCTGCGCGCC GACGAATCAC CTATTCAACA TCTGGCACGT
TTAGCGCCGC AGGAGCTGGA ACAAAAACTG CGTGACTACC TCGGCGGCTT TGGTTTCCAG
GGCGATAAAG TAACCGAAGA AACGCGCCGC TTCTCCGGTG GGGAAAAAGC CCGCCTGGTG
CTGGCATTAA TTGTCTGGCA GCGGCCGAAT CTGCTGCTGC TCGACGAACC GACTAACCAC
CTTGACCTCG ACATGCGTCA GGCACTCACC GAAGCATTAA TCGAGTTTGA AGGCGCGCTG
GTTGTCGTTT CGCACGACCG TCATTTGCTG CGTTCCACCA CTGACGATCT CTACCTGGTT
CACGATCGTA AAGTCGAACC GTTCGACGGC GATCTGGAAG ATTATCAACA GTGGTTGAGC
GACGTACAAA AGCAGGAAAA CCAGACCGAC GAAGCGCCAA AAGAGAATGC GAACAGCGCC
CAGGCACGTA AAGATCAGAA GCGTCGGGAA GCTGAGCTGC GTGCGCAAAC CCAGCCACTG
CGTAAAGAGA TTGCCCGTCT GGAAAAAGAG ATGGAGAAGC TGAACGCGCA ACTGGCGCAG
GCGGAAGAGA AACTCGGCGA CAGCGAACTG TATGACCAGA GCCGTAAAGC GGAGTTGACC
GCCTGCCTGC AACAGCAAGC CAGCGCCAAA TCCGGCCTGG AAGAGTGCGA AATGGCATGG
CTGGAAGCCC AGGAGCAGCT TGAGCAGATG CTGCTGGAAG GCCAAAGCAA CTGA
 
Protein sequence
MIVFSSLQIR RGVRVLLDNA TATINPGQKV GLVGKNGCGK STLLALLKNE ISADGGSYTF 
PGSWQLAWVN QETPALPQAA LEYVIDGDRE YRQLEAQLHD ANERNDGHAI ATIHGKLDAI
DAWSIRSRAA SLLHGLGFSN EQLERPVSDF SGGWRMRLNL AQALICRSDL LLLDEPTNHL
DLDAVIWLEK WLKSYQGTLI LISHDRDFLD PIVDKIIHIE QQSMFEYTGN YSSFEVQRAT
RLAQQQAMYE SQQERVAHLQ SYIDRFRAKA TKAKQAQSRI KMLERMELIA PAHVDNPFRF
SFRAPESLPN PLLKMEKVSA GYGDRIILDS IKLNLVPGSR IGLLGRNGAG KSTLIKLLAG
ELAPVSGEIG LAKGIKLGYF AQHQLEYLRA DESPIQHLAR LAPQELEQKL RDYLGGFGFQ
GDKVTEETRR FSGGEKARLV LALIVWQRPN LLLLDEPTNH LDLDMRQALT EALIEFEGAL
VVVSHDRHLL RSTTDDLYLV HDRKVEPFDG DLEDYQQWLS DVQKQENQTD EAPKENANSA
QARKDQKRRE AELRAQTQPL RKEIARLEKE MEKLNAQLAQ AEEKLGDSEL YDQSRKAELT
ACLQQQASAK SGLEECEMAW LEAQEQLEQM LLEGQSN