Gene EcSMS35_A0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0022 
SymboltraN 
ID6106499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp21631 
End bp23439 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content52% 
IMG OID641614769 
Productconjugal transfer mating pair stabilization protein TraN 
Protein accessionYP_001739910 
Protein GI170650771 
COG category 
COG ID 
TIGRFAM ID[TIGR02750] type-F conjugative transfer system mating-pair stabilization protein TraN 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTA TTTTACCTCT GATACTGGCT CTGGTTGCCG GCATGGCACT GGCTGACAGC 
AACAGTGATT ACCGGGCCGG CTCTGATTTT GCTCACCAGA TCAAAGGACA GGGAAGCAGC
AGTATTCAGG GCTTTAAGCC ACAGGAGAGT ATCCCCGGCT ATAACGCGAA TCCGGACGAG
ACAAAATACT ACGGTGGCGT GACTGCCGGC GGGGATGGCG GCCTGAAAAA TGACGGCACC
ACGGAATGGG CGACCGGTGA AACCGGAAAA ACCATCACAG AGTCATTCAT GAACAAGCCG
AAAGACATTC TTTCACCGGA TGCACCGTTT ATCCAGACCG GCAGGGATGT GGTGAACCGG
GCTGACAGCA TTGTGGGAAA CACCGGGCAG CAGTGCAGTG CGCAGGAGAT TAGCCGGAGT
GAATACACGA ATTACACCTG TGAGCGGGAT TTACAGGTGG AGCAGTACTG CACACGAACT
GCCCGTATGG AGCTGCAGGG GAGTACCACA TGGGAAACCC GGACGCTGGA GTATGAAATG
AGTCAGCTAC CTGCACGTGA AGTGAATGGT CAGTATGTTG TCTCCATTAC CTCTCCCGTT
ACCGGTGAAA TTGTCGATGC GCATTACAGC TGGAGTCGTA CTTACCTGCA GAAGAGTGTA
CCTATGACAA TTACAGTACT GGGAACCCCA CTGAGCTGGA ATGCCAAATA CTCAGCAGAT
GCCTCATTTA CACCAGTACA GAAAACACTG ACTGCCGGTG TTGCTTTTAC GTCATCTCAC
CCCGTCCGGG TCGGGAATAC AAAATTCAGA CGTCATACGG CAATGAAGCT GCGTCTGGTT
GTCAGGGTAA AAAAAGCCTC GTACACCCCG TATGTTGTCT GGTCTGAAAG CTGCCCGTTC
AGCAAGGAGC TGGGGAAACT GACAAAAACA GAATGTACGG AGGCTGGCGG GAACAGGACG
CTGGTGAAAG ACGGTCAGTC ATACAGCATG TACCAGAGCT GCTGGGCATA CCGGGACACG
TATGTCACAC AGTCAGCAGA CAAGGGAACC TGTCAGACCT ACACCGACAA TCCTGCCTGT
ACCCTGGTGT CACACCAGTG CGCCTTTTAC TCCGAAGAAG GTGCCTGTCT GCATGAATAT
GCCACGTACT CCTGTGAGTC AAAAACATCC GGGAAAGTGA TGGTCTGTGG GGGAGACGTC
TTCTGTCTCG ATGGCGAATG CGATAAGGCT CAGAGCGGGA AGAGTAATGA TTTTGCCGAA
GCCGTTTCTC AGCTGGCGGC ACTGGCCGCA GCAGGTAAGG ATGTTGCGGC GCTGAACGGG
GTGGATGTCC GTGCTTTCAC CGGTCAGGCG AAATTTTGTA AGAAGGCTGC AGCCGGATAC
AGCAACTGCT GTAAGGACAG CGGCTGGGGA CAGGATATCG GGCTGGCCAA ATGCAGCAGT
GATGAAAAAG CCCTGGCTAA AGCAAAATCA AACAAACTGA CCGTCAGTGT CGGAGAGTTC
TGTTCGAAGA AAGTGCTGGG TGTCTGCCTG GAGAAAAAAC GCAGTTACTG TCAGTTTGAT
TCGAAGCTGG CGCAGATAGT CCAGCAGCAG GGGCGCAACG GACAGCTGCG TATCAGTTTT
GGCAGTGCGA AGCATCCTGA CTGTCGGGGG ATTACGGTTG ATGAGCTGCA GAAAATTCAG
TTCGACAGAC TGGATTTCAC TAACTTCTAC GAAGACCTGA TGAATAACCA GAAAATCCCG
GACAGTGGTG TTCTGACGCA GAAAGTGAAA GAGCAGATTG CTGACCAGCT GAAACAGGCA
GGACAGTAA
 
Protein sequence
MKRILPLILA LVAGMALADS NSDYRAGSDF AHQIKGQGSS SIQGFKPQES IPGYNANPDE 
TKYYGGVTAG GDGGLKNDGT TEWATGETGK TITESFMNKP KDILSPDAPF IQTGRDVVNR
ADSIVGNTGQ QCSAQEISRS EYTNYTCERD LQVEQYCTRT ARMELQGSTT WETRTLEYEM
SQLPAREVNG QYVVSITSPV TGEIVDAHYS WSRTYLQKSV PMTITVLGTP LSWNAKYSAD
ASFTPVQKTL TAGVAFTSSH PVRVGNTKFR RHTAMKLRLV VRVKKASYTP YVVWSESCPF
SKELGKLTKT ECTEAGGNRT LVKDGQSYSM YQSCWAYRDT YVTQSADKGT CQTYTDNPAC
TLVSHQCAFY SEEGACLHEY ATYSCESKTS GKVMVCGGDV FCLDGECDKA QSGKSNDFAE
AVSQLAALAA AGKDVAALNG VDVRAFTGQA KFCKKAAAGY SNCCKDSGWG QDIGLAKCSS
DEKALAKAKS NKLTVSVGEF CSKKVLGVCL EKKRSYCQFD SKLAQIVQQQ GRNGQLRISF
GSAKHPDCRG ITVDELQKIQ FDRLDFTNFY EDLMNNQKIP DSGVLTQKVK EQIADQLKQA
GQ