Gene EcSMS35_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3701 
SymbolrtcB 
ID6144881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3764385 
End bp3765611 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID641618528 
ProductRtcB protein 
Protein accessionYP_001745668 
Protein GI170679877 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.124467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTACG AATTACTGAC CACTGAAAAT GTCCCGGTAA AAATGTGGAC CAAAGGCGTA 
CCGGTAGAAG ACGATGCGCG TCAGCAACTG ATAAATACGG CGAAGATGCC GTTTATTTTC
AAACATATTG CGGTAATGCC TGATGTACAT CTGGGTAAAG GTTCCACCAT TGGTAGCGTG
ATCCCAACCA AAGGAGCGAT TATTCCGGCG GCGGTGGGGG TAGATATCGG TTGTGGAATG
AACGCGCTGC GTACCGCGTT AACGGCGGCT GACCTGCCTG AAAACCTGGC GGAGCTGCGT
CAGGCGATTG AAGCTGCCGT GCCGCATGGG CGTACCACAG GCCGTTGTAA ACGTGATAAA
GGCGCATGGG AAAAACCGCC TGTTAACGTC GATGCGAAAT GGGCTGAGCT TGAAGCCGGT
TATCAGTGGT TAACGCAAAA ATATCCGCGT TTCCTTAATA CCAATAACTA TAAACACCTG
GGAACGCTGG GAACGGGTAA CCACTTTATT GAAATTTGCC TTGATGAGTC GGATCAGGTG
TGGATTATGC TGCACTCCGG TTCACGCGGA ATTGGTAATG CCATCGGGAC TTACTTTATC
GATCTGGCGC AAAAAGAGAT GCAGGAAACA CTTGAAACGT TGCCATCGCG TGACCTGGCG
TACTTTATGG AAGGTACGGA ATATTTTGAC GATTACCTGA AAGCGGTGGC CTGGGCGCAG
CTTTTTGCCA GCCTTAACCG CGATGCGATG ATGGAAAACG TGGTAACGGC GTTGCAGAGC
GTTACGCAGA AAACGGTAAG ACAGCCACAA ACGCTGGCGA TGGAAGAGAT CAACTGTCAC
CACAACTATG TGCAAAAAGA ACAGCACTTT AGCGAAGAGA TCTATGTGAC GCGTAAAGGC
GCGGTGTCTG CTCGTGCTGG TCAATATGGA ATTATTCCCG GTTCGATGGG GGCGAAAAGC
TTTATTGTCC GTGGGCTGGG AAATGAAGAG TCGTTCTGTT CGTGCAGCCA CGGTGCCGGG
CGGGTAATGA GCCGTACTAA AGCGAAAAAA CTGTTCAGTG TGGAAGACCA AATTCGTGCC
ACCGCGCATG TGGAATGCCG TAAAGATGCC GAAGTGATCG ACGAAATCCC GATGGCGTAT
AAAGATATTG ATGCGGTGAT GGCGGCACAA AGCGATCTGG TGGAAGTTAT CTATACCCTG
CGTCAGGTGG TGTGCGTAAA AGGATAA
 
Protein sequence
MNYELLTTEN VPVKMWTKGV PVEDDARQQL INTAKMPFIF KHIAVMPDVH LGKGSTIGSV 
IPTKGAIIPA AVGVDIGCGM NALRTALTAA DLPENLAELR QAIEAAVPHG RTTGRCKRDK
GAWEKPPVNV DAKWAELEAG YQWLTQKYPR FLNTNNYKHL GTLGTGNHFI EICLDESDQV
WIMLHSGSRG IGNAIGTYFI DLAQKEMQET LETLPSRDLA YFMEGTEYFD DYLKAVAWAQ
LFASLNRDAM MENVVTALQS VTQKTVRQPQ TLAMEEINCH HNYVQKEQHF SEEIYVTRKG
AVSARAGQYG IIPGSMGAKS FIVRGLGNEE SFCSCSHGAG RVMSRTKAKK LFSVEDQIRA
TAHVECRKDA EVIDEIPMAY KDIDAVMAAQ SDLVEVIYTL RQVVCVKG