Gene EcSMS35_A0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0014 
SymboltraG 
ID6106512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp14829 
End bp17651 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content52% 
IMG OID641614761 
Productconjugal transfer mating pair stabilization protein TraG 
Protein accessionYP_001739902 
Protein GI170650902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGAAG TTTATGTGAT TGCCGGTGGT GAGTGGTTGC GGAATAACCT GAACGCCATT 
GCCGCCTTTA TGGGAACCAG GACGTGGGAT TCCATTGAAA AAATTGCGCT CACATTGTCT
GTTCTCGCGG TGGCCGTGAT GTGGGTACAG CGGCACAACG TGATGGATTT GCTGGGCTGG
GTGGCCGTGT TTGTGCTTAT CAGCCTGTTG GTTAATGTCC GCACATCGGT GCAGATTATT
GATAACAGTG ACCTGGTCAA AGTTCACCGG GTGGATAATG TACCGGTCGG ACTGGCGATG
CCACTTTCAC TGACGACCCG TATCGGGCAT GCAATGGTGG CCAGTTACGA GATGATCTTC
ACGCAACCGG ACAGTGTCAC CTACAGCAAA ACGGGGATGC TGTTTGGCGC AGAACTGGTA
TCAAAAAGCA CCGACTTTTT GTCCCGCAAT CCGGAAATCG CCAATCTTTT CCAGGATTAT
GTGCAGAACT GCGTGATGGG GGATATTTAC CTGAACCACA AATACACGCT GGAGGAGCTG
ATGGCCTCCG CTGATCCCTA CACGCTGATT TTTTCCCGCC CCAGTCCGCT ACGGGGCGTT
TATGACAGCA ATAATAATTT CGTGACCTGT AAGGACGCGT CGGTTTCGCT GAAAGACAAG
CTGAATCTGG ATACACAGAG TGGCGGAAAA ACCTGGCATT ATTATGCCCA GCAGCTATTT
GGTGGACGCC CTGATCCGAA CCTGTTGTTC AGTACGCTGA TTGGCGACAG TTACAGTTAC
TTCTATGGCT CAAGTAAATC AGCCAGCCAG ATCATCCGCC AGAATGTCAC CATCAATGCG
CTGAAGGAGG GGATCACCAG TTATGCCGCG CGTAATGGTG ATTCAGCCAG CCTGGTGAAT
CTGGCCACCA CGTCATCGAT GGAGAAGCAA CGTCTGGCAC ATGTCTCCAT CGGTCATGTG
GCGATGCGGA CACTGCCCAT GACGCAGACC ATCCTGACGG GGATCGCCAT CGGTATTTTC
CCGCTTCTGG TACTGGCGGC TGTCTTCAAC AAACTGACGC TGTCCGTCCT GAAAGGCTAT
GTGTTTGCCC TGATGTGGTT GCAGTCATGG CCAATGTTGT ACGCCATTCT GAACAGCGCC
ATGACATTTT ATGCGAAGCA GAATGGTGCA CCGGTGGTGC TCTCGGAAAT TTCACAGATA
CAGCTGAAAT ACTCCGATCT GGCTTCCACT GCCGGGTATC TTTCCATGAT GATCCCGCCA
TTATCCTGGA TGATGGTCAG GGGACTCGGG GCCGGATTTT CCAGTGTGTA CAGCCATTTC
GCTTCCTCTG CTATCAGCCC GACTGCCAGT GCGGCAGGTA GTGTGGTTGA CGGTAATTAC
TCCTACGGCA ACATGCAGAC GGAAAACGTG AACGGATTCA GCTGGAGCAC CAACAGCACC
ACGTCGTTTG GTCAGATGAT GTACCAGACC GGCAGTGGCG CAACCGCCAC ACAGACCCGT
GACGGTAATA TGGTGATGGA TGCGAGCGGG GCGATGTCCC GTTTACCGGT CGGTATCAAT
GCAACGCGTC AGATTGCGGC GGCACAACAG GAAATGGCCC GGGAGGCGTC GAACAGAGCA
GAAAGCGCCC TGCATGGGTT CAGCAGCAGT ATTGCCAGTG CCTGGAACTC GCTCAGTCAG
TTTGGTTCTA ACCGGGGGAG CAGTGATTCT GTCACCGGTG GTGCTGACAG CACGATGAGC
GCACAGGACT CCATGATGGC CAGCCGTATG CGCAGCGCAG TGGAAAGCTA TGCGAAGGCG
CATAATATCA GTAATGAGCA GGCGACAAGA GAACTGGCAT CAAGAAGTAC GAGAGGTTCT
GCGGGGATGT ATGGCGATGC TCATGCTGAA TGGGGGGTTA AACCTAAAAT TCTTGGTGTT
GGTGGTGGCG TAGGTGTTAG AGGTGGTGGT CGGGCAAGTA TTGACTGGAG TGATGAGGAT
GCGCATCAGG CCAGCAGTGG TTCACGGGCC AGCCATGATG CTCGCCATGA TATCGATGCC
AGGGCTACAC AAGACTTTAA AGAGGCCAGC GATTACTTTA CCAGTCGCAA GGTCAGTGAA
TCCGGCAGCC ATACGGACAA TAATGCCGAT TCCCGTGTGG ACCAGCTGTC TGCAGCCCTG
AACTCAGCAA AACAGAGTTA CGACCAGTAC ACGACGAATA TGACCCGCAG CCATGAATAT
GCTGAAATGG CATCCCGTAC TGAAAGCATG AGTGGGCAGA TGAGTGAAGA CCTGTCGCAA
CAGTTTGCAC AGTATGTGAT GAAACACGCG CCGCAGGATG CCGAAGCTAT TCTGACAAAC
ACCAGTTCCC CGGAGATTGC AGAACGCCGT CGGGCTATGG CGTGGTCTTT TGTTCAGGAA
CAGGTGCAGC CTGGTGTTGA TAATGCCTGG CGTGAATCCC GTGGGGACAT CGGTAAAGGA
ATGGAGAGCG TACCTTCGGG GGGGGGCAGC CAGGATATTA TTGCTGATCA TCAGGGGCAT
CAGGCCATTA TTGAGCAAAG AACGCAGGAC AGTAATATCC GTAATGATGT AAAACATCAG
GTTGATAATA TGGTCACAGA ATACCGAGGC AATATCGGAG ATACTCAGAG CAGCATACGT
GGCGAAGAAA ATATTGTAGG AAGACAGTAT TCTGAACTAC AAAATCACCA TAAAACAGAG
GCATTATCTC AGAATAATAA ATATAATGAA GAAAAATCGG GTCAGGAAAG AATGCCCGGG
GCTGACAGTC CACAGGAACT GATGAAAAGA GCAAAGGAGT ATCAGGATAA GCACAAGCAA
TAA
 
Protein sequence
MNEVYVIAGG EWLRNNLNAI AAFMGTRTWD SIEKIALTLS VLAVAVMWVQ RHNVMDLLGW 
VAVFVLISLL VNVRTSVQII DNSDLVKVHR VDNVPVGLAM PLSLTTRIGH AMVASYEMIF
TQPDSVTYSK TGMLFGAELV SKSTDFLSRN PEIANLFQDY VQNCVMGDIY LNHKYTLEEL
MASADPYTLI FSRPSPLRGV YDSNNNFVTC KDASVSLKDK LNLDTQSGGK TWHYYAQQLF
GGRPDPNLLF STLIGDSYSY FYGSSKSASQ IIRQNVTINA LKEGITSYAA RNGDSASLVN
LATTSSMEKQ RLAHVSIGHV AMRTLPMTQT ILTGIAIGIF PLLVLAAVFN KLTLSVLKGY
VFALMWLQSW PMLYAILNSA MTFYAKQNGA PVVLSEISQI QLKYSDLAST AGYLSMMIPP
LSWMMVRGLG AGFSSVYSHF ASSAISPTAS AAGSVVDGNY SYGNMQTENV NGFSWSTNST
TSFGQMMYQT GSGATATQTR DGNMVMDASG AMSRLPVGIN ATRQIAAAQQ EMAREASNRA
ESALHGFSSS IASAWNSLSQ FGSNRGSSDS VTGGADSTMS AQDSMMASRM RSAVESYAKA
HNISNEQATR ELASRSTRGS AGMYGDAHAE WGVKPKILGV GGGVGVRGGG RASIDWSDED
AHQASSGSRA SHDARHDIDA RATQDFKEAS DYFTSRKVSE SGSHTDNNAD SRVDQLSAAL
NSAKQSYDQY TTNMTRSHEY AEMASRTESM SGQMSEDLSQ QFAQYVMKHA PQDAEAILTN
TSSPEIAERR RAMAWSFVQE QVQPGVDNAW RESRGDIGKG MESVPSGGGS QDIIADHQGH
QAIIEQRTQD SNIRNDVKHQ VDNMVTEYRG NIGDTQSSIR GEENIVGRQY SELQNHHKTE
ALSQNNKYNE EKSGQERMPG ADSPQELMKR AKEYQDKHKQ