Gene EcSMS35_A0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0015 
SymboltraH 
ID6106505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp17648 
End bp19024 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content52% 
IMG OID641614762 
Productconjugal transfer pilus assembly protein TraH 
Protein accessionYP_001739903 
Protein GI170650800 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.253255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCAC GCATTAAACC TCTTCTGGTT CTTTGTGCTG CTTTACTGAC GGTCACACCC 
GCAGCGTCAG CGGATGTGAA CAGCGACATG AATCAGTTCT TTAACAAGCT GGGCTTCGCC
TCCAATACCA CACAGCCGGG CGTCTGGCAG GGGCAGGCCG CCGGTTATGC TTACGGTGGC
TCCCTGTATG CCCGTACTCA GGTCAAAAAT GTTCAGCTGA TTTCCATGAC GCTGCCGGAT
ATCAATGCCG GATGTGGCGG CATCGACGCC TACCTCGGCT CGTTCAGTTT TATTAATGGC
GAGCAGCTGC AGCGTTTTGT TAAGCAGATT ATGAGCAATG CTGCCGGTTA CTTTTTTGAC
CTTGCCCTGC AGACAACGGT GCCGGAAATC AAAACCGCAA AAGACTTCCT GCAGAAAATG
GCAAGCGACA TTAACAGTAT GAACCTCAGT TCCTGCCAGG CGGCACAGGG GATTATCGGC
GGGCTTTTCC CCCGGACGCA GGTGTCCCAG CAGAAAGTCT GTCAGGACAT TGCCGGTGAG
AGCAATATTT TTGCTGACTG GGCGGCTTCC CGGCAGGGAT GTACCGTTGG CGGGAAATCT
GACAGTGTCA GGGATAAAGC CAGCGACAAG GATAAGGAGC GGGTGACCAA AAACATCAAC
ATCATGTGGA ATGCGCTTTC CAAAAACAGA ATGTTTGACG GCAACAAAGA GCTGAAAGAG
TTTGTGATGA CGCTGACCGG CTCACTGGTG TTTGGTCCTA ACGGCGAAAT CACACCCTTG
TCAGCCAGAA CCACTGACCG CTCAATTATC CGGGCCATGA TGGAAGGCGG CACAGCAAAA
ATTTATCACT GCAACGATTC TGATAAATGC CTGAAAGTGG TGGCAGACAC ACCGGTGACC
ATCAGCCGGG ATAATGCACT GAAGTCTCAG ATTACTAAAC TTCTGGCCAG CATTCAGAAC
AAGGCTGTCA GTGACACGCC TCTGGATGAC AAGGAAAAAG GCTTTATTTC CAGTACCACC
ATCCCCGTCT TCAAATACCT GGTTGACCCG CAGATGCTCG GTGTTTCCAA CAGTATGATT
TACCAGCTGA CGGACTATAT CGGTTACGAC ATCCTGCTGC AGTACATTCA GGAGCTGATA
CAGCAGGCCC GGGCGATGGT GGCCACGGGA AATTATGACG AAGCAGTTAT CGAACATATT
AACGACAACA TGAATGATGC CACCCGGCAG ATTGCGGCGT TTCAGTCGCA GGTGCAGGTA
CAGCAGGATG CGCTGCTGGT TGTCGATCGT CAGATGAGCT ACATGCGTCA GCAGCTTTCC
GCCCGCATGC TCAGTCGTTA CCAGAACAAC TATCACTTCG GAGGGAGCAC GCTGTGA
 
Protein sequence
MMPRIKPLLV LCAALLTVTP AASADVNSDM NQFFNKLGFA SNTTQPGVWQ GQAAGYAYGG 
SLYARTQVKN VQLISMTLPD INAGCGGIDA YLGSFSFING EQLQRFVKQI MSNAAGYFFD
LALQTTVPEI KTAKDFLQKM ASDINSMNLS SCQAAQGIIG GLFPRTQVSQ QKVCQDIAGE
SNIFADWAAS RQGCTVGGKS DSVRDKASDK DKERVTKNIN IMWNALSKNR MFDGNKELKE
FVMTLTGSLV FGPNGEITPL SARTTDRSII RAMMEGGTAK IYHCNDSDKC LKVVADTPVT
ISRDNALKSQ ITKLLASIQN KAVSDTPLDD KEKGFISSTT IPVFKYLVDP QMLGVSNSMI
YQLTDYIGYD ILLQYIQELI QQARAMVATG NYDEAVIEHI NDNMNDATRQ IAAFQSQVQV
QQDALLVVDR QMSYMRQQLS ARMLSRYQNN YHFGGSTL