Gene EcSMS35_A0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0035 
SymboltraB 
ID6106630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp31851 
End bp33278 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content57% 
IMG OID641614782 
Productconjugal transfer pilus assembly protein TraB 
Protein accessionYP_001739923 
Protein GI170650812 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTA TCAATACCAT TGTGAAACGC AAGCAGTACC TGTGGCTGGG GATTGTGGTT 
GTCGGTACAG CCTCCGCGAT TGGTGGGGCA CTGTATCTGT CTGATGTGGA CATGTCCGGT
AACGGTGAAA CCGTGGCTGA ACAGGAGCCT GTGCCGGATA TGACCGGTGT GGTGGATACG
ACCTTTGATG ACAAGGTGCG TCAGCATGCC ACCACCGAGA TGCAGGTGAC GGCAGCGCAG
ATGCAGAAGC AGTATGAGGA AATCCGTCGT GAGCTGGATG TTCTGAACAA ACAGCGCGGT
GATGACCAGC GTCGTATTGA AAAGCTGGGA CAGGACAATG CCGCCCTGGC AGAGCAGGTA
AAAGCCCTGG GTGCTAATCC CGTCACGGCG ACCGGTGAGC CTGTACCGCA GATGCCTGCC
TCACCGCCCG GCCCGGAAGG CGAACCACAG CCAGGAAACA CCCCCGTATC CTTCCCGCCG
CAGGGCAGCG TTGCTGTTCC ACCGCCGACG GCGTTTTATC CCGGGAATGG TGTCACGCCA
CCACCACAGG TGACGTACCA GTCTGTGCCG GTGCCTAACC GGATACAGCG TAAGGTGTTT
ACCCGTAATG AGGGAAAACA GGGACCATCA CTGCCGTACA TTCCGTCAGG AAGTTTTGCG
AAAGCCATGC TGATTGAAGG GGCGGATGCC AATGCCTCTG TCACCGGTAA TGAATCCACG
GTGCCGATGC AGCTGCGTAT CACCGGCCTG GTGGAAATGC CGAACAGCAA GACGTATGAC
GCAACGGGAT GTTTTGTGGG TCTGGAAGCC TGGGGGGATG TGTCCAGTGA GCGTGCCATT
GTACGCACCC GCAATATCAG TTGCCTGAAG GACGGCAAAA CTATTGATAT GCCGATTAAG
GGGCATGTCA GCTTCCGGGG TAAAAACGGT ATCAAGGGCG AAGTGGTGAT GCGTAACGGC
AAAATCCTCG GCTGGGCATG GGGCGCGGGA TTTGTTGACG GTATCGGTCA GGGAATGGAG
CGCGCCTCCC AGCCGGCTGT CGGGCTGGGT GCCACAGCCG CTTACGGGGC TGGTGATGTC
CTGAAAATGG GTATCGGTGG CGGCGCATCG AAAGCCGCAC AGACGCTCAG TGACTACTAC
ATCAAACGTG CCGAACAGTA TCACCCGGTG ATACCGATTG GTGCGGGCAA CGAAGTGACC
GTGGTGTTCC AGGACGGCTT CCAGCTGAAA ACCGTGGAAG AGATGGCGCT GGAACGCACG
CAGAGCAGAG CGGAAGAAGA CAATCCGGAA AGTCCGGTTC CTGTTCCGCC ATCGGCTGAA
AGTCATCTTA ACGGCTTTAA TACTGACCAG ATGCTGAAGC AGCTGGGCAA CCTGAATCCG
CAGCAGTTTA TGTCCGGCAG CCAGGGAGGG GGCAACGATG GCAAATAA
 
Protein sequence
MASINTIVKR KQYLWLGIVV VGTASAIGGA LYLSDVDMSG NGETVAEQEP VPDMTGVVDT 
TFDDKVRQHA TTEMQVTAAQ MQKQYEEIRR ELDVLNKQRG DDQRRIEKLG QDNAALAEQV
KALGANPVTA TGEPVPQMPA SPPGPEGEPQ PGNTPVSFPP QGSVAVPPPT AFYPGNGVTP
PPQVTYQSVP VPNRIQRKVF TRNEGKQGPS LPYIPSGSFA KAMLIEGADA NASVTGNEST
VPMQLRITGL VEMPNSKTYD ATGCFVGLEA WGDVSSERAI VRTRNISCLK DGKTIDMPIK
GHVSFRGKNG IKGEVVMRNG KILGWAWGAG FVDGIGQGME RASQPAVGLG ATAAYGAGDV
LKMGIGGGAS KAAQTLSDYY IKRAEQYHPV IPIGAGNEVT VVFQDGFQLK TVEEMALERT
QSRAEEDNPE SPVPVPPSAE SHLNGFNTDQ MLKQLGNLNP QQFMSGSQGG GNDGK