Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4048 |
Symbol | |
ID | 6146725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4141377 |
End bp | 4143038 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618874 |
Product | hypothetical protein |
Protein accession | YP_001746012 |
Protein GI | 170683682 |
COG category | [R] General function prediction only |
COG ID | [COG2985] Predicted permease |
TIGRFAM ID | [TIGR01625] AspT/YidE/YbjL antiporter duplication domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.926821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATA TAGCATTAAC GGTCAGTATT CTGGCTTTGG TGGCAGTCGT CGGTTTGTTT ATCGGCAACG TCAAATTTCG CGGCATAGGA TTAGGTATTG GCGGCGTGCT GTTTGGTGGG ATCATCGTCG GCCATTTTGT TTCTCAGGCG GGGATGACAT TAAGTAGCGA TATGCTGCAT GTTATTCAGG AATTTGGCCT GATCCTGTTC GTTTATACCA TCGGGATTCA GGTAGGGCCG GGCTTCTTTG CCTCATTGCG CGTCTCCGGA TTACGCCTCA ACCTGTTTGC TGTTCTGATT GTCATCATCG GGGGTCTGGT TACCGCCATC CTGCATAAAC TGTTTGATAT TCCACTGCCG GTAGTGCTGG GGATTTTCTC CGGTGCGGTA ACCAATACGC CAGCGCTGGG GGCAGGGCAG CAGATCTTGC GCGACCTGGG TACACCAATG GAAATGGTCG ATCAGATGGG GATGAGTTAC GCGATGGCGT ATCCATTCGG TATTTGCGGG ATTTTGTTCA CCATGTGGAT GTTGCGGGTT ATTTTCCGCG TCAATGTCGA GACAGAAGCC CAGCAGCACG AGTCTTCTCG CACCAATGGC GGCGCGCTGA TCAGGACTAT CAATATTCGC GTTGAGAACC CTAACCTGCA TGATTTAGCC ATTAAAGATG TACCGATTCT CAACGGCGAC AAAATTATCT GCTCGCGTCT GAAACGCGAA GAAACCCTAA AAGTACCTTC GCCAGATACC ATTATCCAAC TGGGCGATTT GCTGCATCTG GTGGGGCAGC CAGCGGATTT ACATAATGCG CAACTGGTGA TTGGTCAGGA GGTCGATACC TCGTTGTCCA CGAAAGGCAC TGATTTACGC GTCGAGCGTG TGGTGGTCAC CAATGAAAAC GTGCTCGGTA AACGTATTCG CGACCTGCAC TTTAAAGAAC GCTATGACGT TGTTATCTCG CGCCTGAACC GTGCCGGGGT CGAACTGGTC GCCAGTGGCG ATATCAGCCT GCAGTTCGGC GATATCCTCA ACCTGGTGGG GCGTCCGTCC GCAATTGATG CCGTTGCCAA TGTGCTGGGG AATGCGCAGC AAAAACTGCA ACAGGTTCAG ATGTTGCCGG TGTTTATTGG TATCGGGCTT GGCGTATTGT TAGGTTCTAT TCCCGTCTTT GTGCCGGGAT TCCCGGCCGC GTTGAAACTG GGACTGGCAG GCGGCCCGCT GATTATGGCG TTGATCCTCG GGCGTATCGG CAGTATCGGC AAGCTGTACT GGTTTATGCC GCCAAGCGCC AACCTCGCGC TGCGGGAGCT GGGGATCGTA CTGTTCCTCT CGGTAGTGGG GCTGAAATCT GGTGGGGATT TTGTAAATAC CCTGGTCAAT GGCGAAGGGC TAAGCTGGAT TGGATATGGT GCCCTGATCA CCGCCGTTCC GTTGATTACT GTTGGTATTC TGGCGCGGAT GTTAGCCAAA ATGAATTACC TGACCATGTG CGGGATGCTG GCTGGCTCCA TGACCGATCC ACCGGCGCTG GCATTTGCTA ATAATCTTCA TCCAACGAGC GGTGCAGCGG CGCTCTCTTA CGCCACTGTC TATCCGTTAG TGATGTTCCT GCGCATTATC ACCCCCCAAT TACTGGCGGT GCTCTTCTGG AGTATCGGTT AA
|
Protein sequence | MSDIALTVSI LALVAVVGLF IGNVKFRGIG LGIGGVLFGG IIVGHFVSQA GMTLSSDMLH VIQEFGLILF VYTIGIQVGP GFFASLRVSG LRLNLFAVLI VIIGGLVTAI LHKLFDIPLP VVLGIFSGAV TNTPALGAGQ QILRDLGTPM EMVDQMGMSY AMAYPFGICG ILFTMWMLRV IFRVNVETEA QQHESSRTNG GALIRTINIR VENPNLHDLA IKDVPILNGD KIICSRLKRE ETLKVPSPDT IIQLGDLLHL VGQPADLHNA QLVIGQEVDT SLSTKGTDLR VERVVVTNEN VLGKRIRDLH FKERYDVVIS RLNRAGVELV ASGDISLQFG DILNLVGRPS AIDAVANVLG NAQQKLQQVQ MLPVFIGIGL GVLLGSIPVF VPGFPAALKL GLAGGPLIMA LILGRIGSIG KLYWFMPPSA NLALRELGIV LFLSVVGLKS GGDFVNTLVN GEGLSWIGYG ALITAVPLIT VGILARMLAK MNYLTMCGML AGSMTDPPAL AFANNLHPTS GAAALSYATV YPLVMFLRII TPQLLAVLFW SIG
|
| |