Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03585 |
Symbol | hsrA |
ID | 8115614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 3827509 |
End bp | 3828936 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644849752 |
Product | hypothetical protein |
Protein accession | YP_003001325 |
Protein GI | 251787021 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000182084 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA AAAAGAAGCG CAGTATGGCG GGTTTGCCGT GGATCGCTGC GATGGCCTTC TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC CTTAATCGTT CTCCTCTCGC GATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG ATGCTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATTTTTACC CTTGCAGTGA GTCTGTTCAC ATTGGGTTCT CTGGCCTGCG CACTTTCTAA TTCGCTACCA CAGCTGGTTG TCTTCCGGGT TATTCAGGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT CGACTGGCCT TACTGCGCGC TTATCCTCGT AATGAACTTC TTCCAGTATT GAATTTTGTC GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC AGGCCTTCTT TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC TTTTTGTTGT TTGGCCTCAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA AAGATTGTCG CCAGCTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT CTCTATATTC TCCATGCACG ACGCACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGCTGT GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC GGCTGTATGA TGGCGCCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA GTCTTACGTC GTCTGGGCTA TCGCCATACA TTAGTGGGGA TCACGGTGAT TATTGGGCTA ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG TTGTTTATAT TAGGGATGGC TATGTCGACG CAGTTTACCG CGATGAATAC CATCACACTT GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA CTGTCTATCA GTTTAGGCGT TGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TGAAGGAATG GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACAATGGG CATTATTACT GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
|
Protein sequence | MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA MLIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL LYILHARRTP NPLISLDLFK TRTFSIGIVG NIATRLGTGC VPFLMPLMLQ VGFGYQAFIA GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYEGM EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE
|
| |