Gene EcSMS35_4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4114 
Symbol 
ID6143270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4209396 
End bp4210892 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content50% 
IMG OID641618938 
Productregulatory ATPase RavA 
Protein accessionYP_001746076 
Protein GI170679686 
COG category[R] General function prediction only 
COG ID[COG0714] MoxR-like ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.972847 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.113412 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACC CTCATTTATT AGCGGAAAGA ATTTCCCGCC TGAGCAGTTC GTTGGAAAAG 
GGGCTTTATG AACGTAGCCA CGCCATCCGC TTGTGTTTAT TAGCGGCATT AAGTGGTGAA
AGTGTGTTCC TCCTTGGCCC GCCAGGTATT GCCAAAAGTT TGATCGCCCG GCGCTTAAAA
TTCGCCTTTC AGAATGCCCG CGCGTTTGAA TATCTGATGA CTCGCTTCTC CACGCCGGAA
GAGGTTTTTG GTCCCCTTTC TATTCAGGCG CTAAAAGATG AAGGGCGCTA TGAACGTTTA
ACCAGCGGTT ACCTGCCGGA AGCAGAAATC GTCTTTCTGG ATGAGATCTG GAAAGCGGGT
CCGGCAATTC TTAATACCCT GCTCACCGCC ATTAACGAGC GCCAGTTCCG CAACGGTGCA
CACATAGAAA AAATCCCGAT GCGCCTGCTG GTGGCGGCCT CCAACGAGCT GCCGGAAGCA
GACAGCAGTC TGGAAGCGTT ATATGACCGC ATGCTGATTC GTCTGTGGTT AGATAAAGTG
CAGGATAAAG CGAATTTCCG CTCCATGCTG ACCAGTCAAC AGGATGAAAA CGACAATCCG
GTTCCTACCT CCTTGCAGGT CACAGATGAA GAATATGAAC GCTGGCAGAA AGAGATTGGT
GAAATTACGC TGCCCGATCA TGTATTTGAG CTGATTTTTA TGCTGCGCCA GCAACTGGAT
AAATTACCGG ATGCGCCTTA TGTCTCGGAT CGTCGCTGGA AAAAAGCGAT TCGATTATTG
CAGGCCAGCG CCTTTTTTAG CGGTCGCAGT GCTGTTGCCC CGGTTGATCT CATTTTGCTG
AAAGATTGCC TGTGGTATGA CGCGCAAAGC CTGAATTTGA TACAACAACA AATCGATGTA
TTGATGACCG GTCATGCCTG GCAACAGCAA GGGATGTTGA CCCGCCTGGG CGCGATTGTG
CAACGTCACC TGCAACTACA GCAGCAACAA AGCGATAAAA CAGCCTTAAC GGTAATTCGT
CTGGGCGGCA TTTTCAGCCG TCGTCAGCAG TATCAACTCC CTGTTAACGT TACTGCTTCC
ACTCTGACTC TGCTGCTGCA AAAACCGTTA AAACTGCATG ATATGGAAGT GGTTCATATC
ACCTTTGAGC GTAGCGCGCT GGAACAGTGG CTGAGCAAAG GTGGTGAAAT TCGCGGCAAA
CTAAACGGTA TCGGCTTTGC CCAGAAACTG AATCTGGAAG TTGATAGCAC CCAACATCTT
GTTGTACGCG ATGTAAGTTT ACAAGGCAGT ACGCTGGCAC TTCCCGGTTC ATCGGCTGAA
GGTCTGCCAG GTGAAATAAA ACAACAACTG GAAGAGCTTG AAAGCGACTG GCGCAAGCAA
CACGCTTTAT TCAGCGAACA GCAAAAATGT CTGTTTATCC CTGGCGACTG GTTAGGTCGC
ATTGAAGCCA GCCTACAGGA TGTCGGTGCA CAGATTCGCC AGGCGCAACA ATGCTAA
 
Protein sequence
MAHPHLLAER ISRLSSSLEK GLYERSHAIR LCLLAALSGE SVFLLGPPGI AKSLIARRLK 
FAFQNARAFE YLMTRFSTPE EVFGPLSIQA LKDEGRYERL TSGYLPEAEI VFLDEIWKAG
PAILNTLLTA INERQFRNGA HIEKIPMRLL VAASNELPEA DSSLEALYDR MLIRLWLDKV
QDKANFRSML TSQQDENDNP VPTSLQVTDE EYERWQKEIG EITLPDHVFE LIFMLRQQLD
KLPDAPYVSD RRWKKAIRLL QASAFFSGRS AVAPVDLILL KDCLWYDAQS LNLIQQQIDV
LMTGHAWQQQ GMLTRLGAIV QRHLQLQQQQ SDKTALTVIR LGGIFSRRQQ YQLPVNVTAS
TLTLLLQKPL KLHDMEVVHI TFERSALEQW LSKGGEIRGK LNGIGFAQKL NLEVDSTQHL
VVRDVSLQGS TLALPGSSAE GLPGEIKQQL EELESDWRKQ HALFSEQQKC LFIPGDWLGR
IEASLQDVGA QIRQAQQC