Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0522 |
Symbol | |
ID | 6144864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 529768 |
End bp | 531444 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615416 |
Product | putative cation:proton antiport protein |
Protein accession | YP_001742623 |
Protein GI | 170681342 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4651] Kef-type K+ transport system, predicted NAD-binding component |
TIGRFAM ID | [TIGR00932] transporter, monovalent cation:proton antiporter-2 (CPA2) family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.4112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCACG CCACCCCGCT TATCACCACC ATTGTTGGCG GCCTTGTGCT CGCCTTTATC CTCGGCATGC TGGCCAATAA ACTACGTATT TCTCCTCTGG TGGGATATCT GTTAGCGGGT GTGCTGGCAG GACCATTCAC TCCGGGCTTT GTTGCCGATA CCAAGCTTGC GCCGGAACTG GCTGAACTGG GCGTCATTCT GTTGATGTTT GGCGTGGGTC TGCACTTTTC GCTGAAGGAT TTGATGGCGG TAAAGGCCAT CGCCATTCCC GGTGCGATCG CCCAGATAGC CGTGGCGACG CTGCTGGGTA TGGCGCTCTC CGCCGTGCTG GGCTGGTCGT TAATGACCGG TATCGTGTTC GGCTTATGCC TTTCCACCGC CAGTACCGTG GTGTTACTGC GCGCACTTGA AGAACGGCAA TTAATAGACA GTCAGCGTGG GCAAATCGCC ATCGGTTGGT TGATTGTGGA AGACCTGGTA ATGGTTCTGA CGCTGGTGTT GCTGCCCGCA GTGGCAGGAA TGATGGAACA AGGCGATGTG GGCTTTGCCA CTCTTGCTGT CGATATGGGG ATCACCATCG GCAAGGTGAT CGCATTTATC GCCATTATGA TGCTGGTAGG TCGCCGTCTG GTGCCGTGGA TTATGGCACG TAGCGCAGCA ACCGGTTCCC GTGAGCTGTT TACCCTGTCG GTGCTGGCGC TGGCGTTAGG GATTGCCTTT GGTGCGGTGG AGCTGTTTGA TGTCTCCTTT GCACTCGGTG CGTTCTTTGC CGGGATGGTA CTGAACGAGT CTGAACTGAG TCACCGTGCC GCCCACGATA CGCTGCCATT GCGCGACGCG TTTGCGGTGC TGTTTTTTGT CTCCGTCGGG ATGTTGTTTG ATCCGTTAAT TCTGATTCAG CAACCGCTGG CAGTGCTGGC GACGCTGGCG ATTATTCTGT TTGGTAAGTC GTTAGCCGCG TTTTTCCTGG TGCGACTGTT TGGTCACTCC CAACGTACGG CATTAACTAT CGCCGCCAGC CTGGCGCAAA TTGGTGAGTT CGCGTTTATC CTGGCGGGAC TGGGCATGGC GCTGGATCTG TTGCCGCAAG CCGGGCAGAA CCTGGTACTG GCAGGGGCGA TTTTGTCGAT TATGCTCAAC CCGGTCCTGT TTGCGCTGCT GGAGAAATAT CTGGCGAAGA CCGAAACGCT GGAAGAGCAG ACGCTGGAAG AGGCTATCGA AGAAGAGAAG CAGATCCCAG TGGATATTTG CAACCATGCG CTACTGGTAG GTTACGGTCG TGTAGGCAGC CTGCTGGGGG AGAAATTGCT CGCCTCTGAT ATTCCGCTGG TGGTGATTGA GACGTCACGA ACCCGTGTTG ATGAACTGCG AGAGCGCGGG GTCCGCGCAG TATTGGGCAA TGCGGCGAAC GAAGAAATTA TGCAACTGGC GCATCTGGAA TGTGCAAAAT GGCTGATCCT GACGATTCCC AACGGTTATG AAGCGGGCGA GATTGTGGCA TCTGCCCGCG CGAAAAATCC GGATATTGAG ATTATCGCCC GCGCCCATTA TGACGATGAA GTGGCGTATA TCACGGAACG TGGTGCGAAT CAGGTAGTGA TGGGTGAGCG TGAAATTGCT CGTACTATGC TGGAACTGCT GGAAACACCA CCGGCGGGTA AAGTGGTGAC GGGGTAA
|
Protein sequence | MHHATPLITT IVGGLVLAFI LGMLANKLRI SPLVGYLLAG VLAGPFTPGF VADTKLAPEL AELGVILLMF GVGLHFSLKD LMAVKAIAIP GAIAQIAVAT LLGMALSAVL GWSLMTGIVF GLCLSTASTV VLLRALEERQ LIDSQRGQIA IGWLIVEDLV MVLTLVLLPA VAGMMEQGDV GFATLAVDMG ITIGKVIAFI AIMMLVGRRL VPWIMARSAA TGSRELFTLS VLALALGIAF GAVELFDVSF ALGAFFAGMV LNESELSHRA AHDTLPLRDA FAVLFFVSVG MLFDPLILIQ QPLAVLATLA IILFGKSLAA FFLVRLFGHS QRTALTIAAS LAQIGEFAFI LAGLGMALDL LPQAGQNLVL AGAILSIMLN PVLFALLEKY LAKTETLEEQ TLEEAIEEEK QIPVDICNHA LLVGYGRVGS LLGEKLLASD IPLVVIETSR TRVDELRERG VRAVLGNAAN EEIMQLAHLE CAKWLILTIP NGYEAGEIVA SARAKNPDIE IIARAHYDDE VAYITERGAN QVVMGEREIA RTMLELLETP PAGKVVTG
|
| |