Gene EcSMS35_2328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2328 
Symbol 
ID6145877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2360931 
End bp2361956 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID641617202 
ProductABC transporter, permease protein 
Protein accessionYP_001744375 
Protein GI170680077 
COG category[R] General function prediction only 
COG ID[COG4239] ABC-type uncharacterized transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0125701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAC TCAGCCCCGT CAATCAGGCC CGTTGGGCGC GTTTTCGCCA TAACCGTCGC 
GGCTACTGGT CGTTATGGAT TTTCCTCGTC TTGTTTGGTT TGAGTTTGTG TTCTGAACTT
ATCGCCAACG ATAAACCGTT GCTGGTACGT TATGACGGCA GTTGGTATTT CCCGTTGTTG
AAAAACTACA GCGAAAGCGA TTTTGGTGGC CCGCTGGCAA GTCAGGCTGA TTATCAGGAC
CCGTGGCTGA AACAACGGCT GGAAAATAAC GGCTGGGTAC TGTGGGCACC GATTCGCTTT
GGTGCTACCA GTATCAACTT TGCTACCGAT AAGCCCTTCC CTGCTCCACC CTCCCGGCAA
AACTGGCTGG GAACGGATGC CAATGGCGGC GATGTGCTGG CGCGCATTCT CTATGGCACG
CGGATCTCAG TTCTGTTTGG CCTGATGCTG ACCCTCTGCT CCAGCGTGAT GGGCGTGTTG
GCGGGGGCGC TACAAGGCTA TTACGGCGGT AAAGTCGATC TCTGGGGGCA ACGTTTTATT
GAAGTATGGT CGGGGATGCC GACGCTGTTT TTGATTATTT TACTTTCCAG CGTCGTACAG
CCTAACTTCT GGTGGCTGTT GGCAATTACG GTCTTGTTTG GCTGGATGAG TCTGGTCGGC
GTGGTGCGGG CGGAGTTTTT ACGTACCCGT AATTTCGACT ACATCCGTGC GGCGCAGGCG
CTTGGCGTCA GCGATCGCAG TATCATCCTG CGTCATATGT TGCCAAATGC CATGGTCGCT
ACCCTCACCT TTTTACCGTT TATTTTATGT AGTTCGATCA CCACCCTGAC CTCACTCGAT
TTCCTCGGCT TCGGTCTGCC GCTCGGTTCA CCGTCACTCG GCGAACTGCT GTTACAAGGG
AAAAATAACC TTCAGGCCCC GTGGCTTGGG ATCACCGCCT TCTTGTCGGT GGCGATATTG
TTGTCTTTGC TGATCTTTAT TGGTGAAGCC GTCCGCGACG CATTTGATCC TAATAAGGCG
GTGTAG
 
Protein sequence
MPRLSPVNQA RWARFRHNRR GYWSLWIFLV LFGLSLCSEL IANDKPLLVR YDGSWYFPLL 
KNYSESDFGG PLASQADYQD PWLKQRLENN GWVLWAPIRF GATSINFATD KPFPAPPSRQ
NWLGTDANGG DVLARILYGT RISVLFGLML TLCSSVMGVL AGALQGYYGG KVDLWGQRFI
EVWSGMPTLF LIILLSSVVQ PNFWWLLAIT VLFGWMSLVG VVRAEFLRTR NFDYIRAAQA
LGVSDRSIIL RHMLPNAMVA TLTFLPFILC SSITTLTSLD FLGFGLPLGS PSLGELLLQG
KNNLQAPWLG ITAFLSVAIL LSLLIFIGEA VRDAFDPNKA V