Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_A0015 |
Symbol | traH |
ID | 6106505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010488 |
Strand | - |
Start bp | 17648 |
End bp | 19024 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641614762 |
Product | conjugal transfer pilus assembly protein TraH |
Protein accession | YP_001739903 |
Protein GI | 170650800 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.253255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCCAC GCATTAAACC TCTTCTGGTT CTTTGTGCTG CTTTACTGAC GGTCACACCC GCAGCGTCAG CGGATGTGAA CAGCGACATG AATCAGTTCT TTAACAAGCT GGGCTTCGCC TCCAATACCA CACAGCCGGG CGTCTGGCAG GGGCAGGCCG CCGGTTATGC TTACGGTGGC TCCCTGTATG CCCGTACTCA GGTCAAAAAT GTTCAGCTGA TTTCCATGAC GCTGCCGGAT ATCAATGCCG GATGTGGCGG CATCGACGCC TACCTCGGCT CGTTCAGTTT TATTAATGGC GAGCAGCTGC AGCGTTTTGT TAAGCAGATT ATGAGCAATG CTGCCGGTTA CTTTTTTGAC CTTGCCCTGC AGACAACGGT GCCGGAAATC AAAACCGCAA AAGACTTCCT GCAGAAAATG GCAAGCGACA TTAACAGTAT GAACCTCAGT TCCTGCCAGG CGGCACAGGG GATTATCGGC GGGCTTTTCC CCCGGACGCA GGTGTCCCAG CAGAAAGTCT GTCAGGACAT TGCCGGTGAG AGCAATATTT TTGCTGACTG GGCGGCTTCC CGGCAGGGAT GTACCGTTGG CGGGAAATCT GACAGTGTCA GGGATAAAGC CAGCGACAAG GATAAGGAGC GGGTGACCAA AAACATCAAC ATCATGTGGA ATGCGCTTTC CAAAAACAGA ATGTTTGACG GCAACAAAGA GCTGAAAGAG TTTGTGATGA CGCTGACCGG CTCACTGGTG TTTGGTCCTA ACGGCGAAAT CACACCCTTG TCAGCCAGAA CCACTGACCG CTCAATTATC CGGGCCATGA TGGAAGGCGG CACAGCAAAA ATTTATCACT GCAACGATTC TGATAAATGC CTGAAAGTGG TGGCAGACAC ACCGGTGACC ATCAGCCGGG ATAATGCACT GAAGTCTCAG ATTACTAAAC TTCTGGCCAG CATTCAGAAC AAGGCTGTCA GTGACACGCC TCTGGATGAC AAGGAAAAAG GCTTTATTTC CAGTACCACC ATCCCCGTCT TCAAATACCT GGTTGACCCG CAGATGCTCG GTGTTTCCAA CAGTATGATT TACCAGCTGA CGGACTATAT CGGTTACGAC ATCCTGCTGC AGTACATTCA GGAGCTGATA CAGCAGGCCC GGGCGATGGT GGCCACGGGA AATTATGACG AAGCAGTTAT CGAACATATT AACGACAACA TGAATGATGC CACCCGGCAG ATTGCGGCGT TTCAGTCGCA GGTGCAGGTA CAGCAGGATG CGCTGCTGGT TGTCGATCGT CAGATGAGCT ACATGCGTCA GCAGCTTTCC GCCCGCATGC TCAGTCGTTA CCAGAACAAC TATCACTTCG GAGGGAGCAC GCTGTGA
|
Protein sequence | MMPRIKPLLV LCAALLTVTP AASADVNSDM NQFFNKLGFA SNTTQPGVWQ GQAAGYAYGG SLYARTQVKN VQLISMTLPD INAGCGGIDA YLGSFSFING EQLQRFVKQI MSNAAGYFFD LALQTTVPEI KTAKDFLQKM ASDINSMNLS SCQAAQGIIG GLFPRTQVSQ QKVCQDIAGE SNIFADWAAS RQGCTVGGKS DSVRDKASDK DKERVTKNIN IMWNALSKNR MFDGNKELKE FVMTLTGSLV FGPNGEITPL SARTTDRSII RAMMEGGTAK IYHCNDSDKC LKVVADTPVT ISRDNALKSQ ITKLLASIQN KAVSDTPLDD KEKGFISSTT IPVFKYLVDP QMLGVSNSMI YQLTDYIGYD ILLQYIQELI QQARAMVATG NYDEAVIEHI NDNMNDATRQ IAAFQSQVQV QQDALLVVDR QMSYMRQQLS ARMLSRYQNN YHFGGSTL
|
| |