Gene EcSMS35_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2447 
Symbol 
ID6146432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2496395 
End bp2498227 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content52% 
IMG OID641617319 
Productputative transporter 
Protein accessionYP_001744491 
Protein GI170680266 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.392855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGTG AATTGATTTG GGTTCTTTCA TTACTGGCCG TTGCCATCGT CTTGTTTGCG 
ACGGGCAGAG TGCGTATGGA TGCGGTCGCT TTGTTTGTTA TTGTCGCGTT TGCATTAAGC
GGAACGCTGA CGGTCCCCGA AGTATTTTCC GGCTTTTCCG ATCCTAACGT TGTCCTGATT
GCCGCCTTGT TCATTATTGG CGATGGTTTG GTCCGTACCG GTGTTGCCAC TGTAATGGGG
ACATGGCTGG TCAAAGTGGC GGGCAACAGT GAAATCAAAA TGTTGGTTTT GTTGATGCTG
ACCGTCGCGG GGCTTGGCGC GTTTATGAGT TCAACCGGCG TTGTCGCTAT CTTTATTCCC
GTGGTGTTAA GCGTTGCCAT GCGTATGCAA ACTTCGCCGT CGCGCCTGAT GATGCCGTTG
AGTTTTGCCG GGCTGATTAG CGGCATGATG ACGCTGGTGG CGACGCCGCC GAACCTGGTA
GTCAACAGTG AATTGCTGCG TGAAGGCTAT CACGGCTTCA GTTTCTTTAG CGTAACACCT
ATTGGCCTGG TCGTGCTGGT GCTGGGTATT TTGTATATGT TAGTGATGCG TTTCATGCTG
AAAGGGGATA CCCAGACCCC GCAGCGCGAA GGCTGGACGC GTCGAACCTT TCGCGATCTT
ATCCGTGAAT ATCGACTGAC CGGGCGTGCG CGTCGTCTGG CTATTCGCCC CGGTTCGCCA
ATGATTGGTC AACGGCTGGA CGATCTCAAA TTACGTGAGC GTTATGGCGC TAACGTCATC
GGTGTTGAAC GCTGGCGGCG TTTTCGTCGC GTTATCGTGA ACGTTAATGG GGTTTCTGAA
TTTCGCGCGC GTGACGTTTT GCTTATTGAT ATGTCTGCGG CTGATGTCGA TCTCCGGCAA
TTTTGTAGTG AGCAATTGCT GGAGCCGATG GTACTGCGCG GCGAGTATTT TTCTGACCAG
GCCCTTGATG TGGGCATGGC AGAGATTTCA TTAATTCCTG AGTCTGAACT TATTGGTAAA
TCGGTGCGCG AAATTGGTTT TCGTACCCGC TACGGACTGA ATGTGGTGGG GCTAAAGCGC
AATGGCGAGG CGATGGAAGG TTCGCTGGCG GATGAGCCTC TGCTGCTGGG CGATATCATC
CTGGTTGTGG GTAACTGGAA ACTGATCGGT ATGCTGGCAA AACAGGGGCG TGACTTTGTG
GCGCTGAACT TACCAGAAGA AGTGAGTGAA GCATCGCCAG CACACAGTCA GGCACCTCAT
GCCATTTTTT GCCTGGTGTT GATGGTGGCG TTAATGCTGA CCGATGAAAT TCCTAATCCT
GTTGCCGCTA TCATCGCCTG CCTGCTGATG GGGAAATTCC GCTGTATAGA TGCTGAAAGC
GCCTATAAAT CCATTCACTG GCCGAGCATT ATTTTGATCG TTGGGATGAT GCCATTTGCT
GTGGCATTAC AGAAAACGGG AGGTGTCGCG CTGGCGGTGA AAGGGCTGAT GGACATTGGC
GGCGGTTATG GGCCACATAT GATGCTGGGG TGTTTGTTTG TCTTGTCGGC GGTTATTGGG
CTATTTATTT CTAATACCGC GACGGCGGTG TTGATGGCTC CGATTGCGCT GGCTGCTGCC
AAAACGATGG GTGTGTCGCC TTATCCATTT GCGATGGTCG TGGCGATGGC AGCATCCGCC
GCCTTTATGA CACCGGTTTC TTCACCGGTT AACACACTGG TTTTAGGCCC GGGAAATTAC
AGCTTCAGTG ACTTTGTGAA GTTGGGGGTA CCGTTCACCA TTATCGTGAT GGCGGTTTGT
GTGGTGATGA TCCCGATGCT GTTTCCGTTT TGA
 
Protein sequence
MNGELIWVLS LLAVAIVLFA TGRVRMDAVA LFVIVAFALS GTLTVPEVFS GFSDPNVVLI 
AALFIIGDGL VRTGVATVMG TWLVKVAGNS EIKMLVLLML TVAGLGAFMS STGVVAIFIP
VVLSVAMRMQ TSPSRLMMPL SFAGLISGMM TLVATPPNLV VNSELLREGY HGFSFFSVTP
IGLVVLVLGI LYMLVMRFML KGDTQTPQRE GWTRRTFRDL IREYRLTGRA RRLAIRPGSP
MIGQRLDDLK LRERYGANVI GVERWRRFRR VIVNVNGVSE FRARDVLLID MSAADVDLRQ
FCSEQLLEPM VLRGEYFSDQ ALDVGMAEIS LIPESELIGK SVREIGFRTR YGLNVVGLKR
NGEAMEGSLA DEPLLLGDII LVVGNWKLIG MLAKQGRDFV ALNLPEEVSE ASPAHSQAPH
AIFCLVLMVA LMLTDEIPNP VAAIIACLLM GKFRCIDAES AYKSIHWPSI ILIVGMMPFA
VALQKTGGVA LAVKGLMDIG GGYGPHMMLG CLFVLSAVIG LFISNTATAV LMAPIALAAA
KTMGVSPYPF AMVVAMAASA AFMTPVSSPV NTLVLGPGNY SFSDFVKLGV PFTIIVMAVC
VVMIPMLFPF