Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01191 |
Symbol | ychM |
ID | 8116040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 1246679 |
End bp | 1248358 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644847443 |
Product | hypothetical protein |
Protein accession | YP_002999016 |
Protein GI | 251784712 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.818983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAACAAAA TATTTTCCTC ACATGTGATG CCTTTCCGCG CTCTGATCGA CGCTTGCTGG AAAGAAAAAT ATACTGCCGC ACGGTTTACC CGTGACCTGA TTGCCGGGAT AACCGTCGGG ATTATTGCTA TCCCGCTGGC GATGGCGTTG GCTATTGGTA GTGGTGTGGC ACCCCAGTAC GGTTTATATA CCGCAGCTGT TGCGGGGATT GTCATTGCTC TGACGGGTGG GTCACGCTTT AGCGTTTCCG GTCCGACTGC GGCATTTGTG GTAATTCTCT ATCCCGTGTC GCAACAGTTT GGACTGGCAG GACTGCTGGT TGCGACCTTG CTGTCGGGGA TCTTTTTGAT TCTGATGGGT CTGGCACGCT TTGGTCGCCT GATTGAGTAT ATTCCGGTTT CCGTCACCTT AGGTTTCACC TCGGGTATCG GGATCACCAT CGGTACCATG CAGATTAAAG ATTTTCTCGG TCTGCAAATG GCCCATGTCC CGGAACATTA TCTACAAAAA GTCGGCGCAT TATTTATGGC GCTGCCGACC ATTAATGTGG GTGATGCTGC CATTGGCATT GTGACGCTAG GTATTCTTGT TTTTTGGCCG CGTCTGGGCA TTCGTTTACC CGGTCACCTT CCGGCCTTGC TGGCTGGTTG CGCGGTGATG GGGATTGTTA ACCTGCTCGG CGGACATGTT GCTACCATCG GTTCGCAATT CCACTACGTC CTGGCCGATG GTTCTCAGGG TAACGGTATT CCGCAACTGC TGCCGCAACT GGTGCTGCCG TGGGATCTGC CTAATTCAGA ATTCACGCTA ACCTGGGATT CTATTCGCAC ACTGCTGCCT GCGGCATTCT CAATGGCAAT GCTCGGCGCA ATCGAATCTC TGCTCTGCGC CGTGGTACTG GATGGTATGA CCGGGACGAA ACACAAGGCG AACAGCGAAC TGGTTGGACA GGGACTGGGG AATATTATCG CTCCGTTCTT TGGTGGTATT ACCGCTACAG CTGCCATCGC GCGTTCTGCC GCTAACGTCC GTGCCGGGGC AACTTCCCCT ATCTCGGCGG TGATCCACTC TATTCTGGTT ATTCTTGCCC TGCTGGTACT GGCACCGCTG CTCTCCTGGC TGCCGCTTTC CGCTATGGCA GCCCTGCTGT TGATGGTGGC GTGGAACATG AGTGAAGCGC ATAAAGTGGT CGACTTGCTG CGTCATGCAC CGAAAGATGA CATCATTGTC ATGCTGCTGT GCATGTCGCT GACCGTGCTG TTTGATATGG TTATTGCCAT CAGCGTGGGG ATCGTGCTGG CATCGCTGCT GTTTATGCGT CGTATCGCAC GTATGACTCG CCTGGCACCG GTAGTCGTAG ATGTTCCAGA CGATGTTCTG GTACTGCGCG TTATTGGCCC GCTGTTTTTT GCTGCTGCTG AAGGCTTGTT CACGGACCTG GAGTCACGTC TTGAAGGCAA ACGGATTGTG ATTCTGAAGT GGGATGCCGT TCCGGTACTT GATGCTGGTG GTCTTGATGC GTTCCAGCGT TTTGTGAAGC GTCTGCCCGA AGGATGTGAA CTGCGCGTGT GCAACGTGGA ATTCCAGCCA CTGCGCACTA TGGCTCGCGC AGGCATTCAA CCGATCCCGG GACGCCTCGC GTTCTTCCCG AATCGTCGCG CGGCGATGGC GGATTTATAA
|
Protein sequence | MNKIFSSHVM PFRALIDACW KEKYTAARFT RDLIAGITVG IIAIPLAMAL AIGSGVAPQY GLYTAAVAGI VIALTGGSRF SVSGPTAAFV VILYPVSQQF GLAGLLVATL LSGIFLILMG LARFGRLIEY IPVSVTLGFT SGIGITIGTM QIKDFLGLQM AHVPEHYLQK VGALFMALPT INVGDAAIGI VTLGILVFWP RLGIRLPGHL PALLAGCAVM GIVNLLGGHV ATIGSQFHYV LADGSQGNGI PQLLPQLVLP WDLPNSEFTL TWDSIRTLLP AAFSMAMLGA IESLLCAVVL DGMTGTKHKA NSELVGQGLG NIIAPFFGGI TATAAIARSA ANVRAGATSP ISAVIHSILV ILALLVLAPL LSWLPLSAMA ALLLMVAWNM SEAHKVVDLL RHAPKDDIIV MLLCMSLTVL FDMVIAISVG IVLASLLFMR RIARMTRLAP VVVDVPDDVL VLRVIGPLFF AAAEGLFTDL ESRLEGKRIV ILKWDAVPVL DAGGLDAFQR FVKRLPEGCE LRVCNVEFQP LRTMARAGIQ PIPGRLAFFP NRRAAMADL
|
| |