Gene EcE24377A_C0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_C0004 
SymboltraH 
ID5585717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009787 
Strand
Start bp11467 
End bp12972 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content51% 
IMG OID640913795 
Productconjugal transfer pilus assembly protein TraH 
Protein accessionYP_001451445 
Protein GI157149420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCAT CACCACGGAG CAGGCCCGCG AAATTGCGAT CCGCTGTCAT GAACGGCAGA 
TTCAGCATCA GCAACGCTGG GTTACCTGAA GGTATTTATT TTCGGACGTT TCCCTCTTTA
TTTACGAAAG GGGGAGCAAT GAAAACTTTC CTGAGACAAT CATTTCTCTC GCTGCTCATT
GGCACTGCGT TGTGCACGTC CGCCAGTGCC GGGCTGCAGG ATGACATGAA TTCGTTTTTT
AACAATATGA GCTATGCCAG CAACGCCACC TCAGCGAAAG CATGGCAGGG ACAGGCGGCA
CGCTATGTAA CCGGCGGTTC GTTCTATGCC CGCACAGGAA ACAAAAATAT CCAGCTGATA
TCCATCAGCC TGCCGTCCAT CAACGCCGGA TGTGGCGGGA TTGATGTCTA TCTTGGGTCT
TTCTCCTTTA TTAACTCCGA CCAGATTATG GCGTTTGTGA AACAGACGAT GGCTAACGCG
GCGGGGTACT TTTTCGACCT TGCACTGGAA ACTACCGTGC CTGAGTTGAA AGCGGCAAAA
GACTTCCTGC AGAAGATGGC TGCTGACCTT AACCGTTTCA ATATGTCCAG CTGTCAGGCG
GCAAAAGCGA TGGTCGACAG CGTGGCGTCG CTGTGGGGGG AAAGTCAGCA GAACGTCTGC
CAGTCTGTCG CCGGTCAGAA TAACGTGTTC TCGGACTGGG TCTCCTCCCG TCAGGGCTGC
ACATCCGGCG GGAAATACGA AAGCGTCACG AACAAGGCTA CCGGCGCAGA AAAAGATCAG
GTCCTGAAGG ATATCAACCT GATGTGGGAT GCTCTCAGCA ACAGTACGCT CAGCAGCAAT
GCAGAGTTAC GCCAGTTTGC CATGAGCATC AGCGGTTCGG TGATTTTCGG CAGTAACGGG
GAAATGCGAA TCCTGTCTTC GCTGGCATCA GACCGCAGCC TGTTGAGTGC GATGATGAGT
GGTGGCAGCG CCAAAGTGTA CGTCTGTGAT AACCAGAACA AATGTCTGTC ACCCTCCCTG
AATAACGTGA CCATTTCGGA GTCAAAATCT CTGATCCGCA TGGTGCGGGA CACGCTGACC
AGCATAGAAA ATAAAGCCAT TACGGACACA CCATTGACGG AGAGAGAGAA GCAGTTCATC
AACAGCACCT CCATTCCCAT CCTGTCCTGG ATAGTGGATC AGTCATCCCT GAGTGTTTCG
CAGTCCCTGT TTGCTCAGCT GACGGATTAC ATCGCCGTCG ATATTTATCT GCAGTATCTG
GAAGCTGTCA TGAAGGTGGT CAATGGTTCA CTGGCTACCA AAGACTATCC GGGGGCCAAT
ATGAATGAAC TGAAAAATGG CCTGGCAGAT GCGCGCCAGG CGCTCAACTC ACTGCGTATG
GAGGTTCAGA TTAAGGAAAA TGCGCTTATT TCTGCACAAC AGCAAATCCG TTTTATCCGC
CAGCAGGTCT CCTCAAAAAT GAGCGATCGC GTACTCGGTA ACTATCAGTT CAGCAGGGTG
AATTAA
 
Protein sequence
MTSSPRSRPA KLRSAVMNGR FSISNAGLPE GIYFRTFPSL FTKGGAMKTF LRQSFLSLLI 
GTALCTSASA GLQDDMNSFF NNMSYASNAT SAKAWQGQAA RYVTGGSFYA RTGNKNIQLI
SISLPSINAG CGGIDVYLGS FSFINSDQIM AFVKQTMANA AGYFFDLALE TTVPELKAAK
DFLQKMAADL NRFNMSSCQA AKAMVDSVAS LWGESQQNVC QSVAGQNNVF SDWVSSRQGC
TSGGKYESVT NKATGAEKDQ VLKDINLMWD ALSNSTLSSN AELRQFAMSI SGSVIFGSNG
EMRILSSLAS DRSLLSAMMS GGSAKVYVCD NQNKCLSPSL NNVTISESKS LIRMVRDTLT
SIENKAITDT PLTEREKQFI NSTSIPILSW IVDQSSLSVS QSLFAQLTDY IAVDIYLQYL
EAVMKVVNGS LATKDYPGAN MNELKNGLAD ARQALNSLRM EVQIKENALI SAQQQIRFIR
QQVSSKMSDR VLGNYQFSRV N