Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3709 |
Symbol | |
ID | 5588536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3704999 |
End bp | 3706321 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640927332 |
Product | putative cryptic C4-dicarboxylate transporter DcuD |
Protein accession | YP_001464699 |
Protein GI | 157155112 |
COG category | [C] Energy production and conversion |
COG ID | [COG3069] C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00771] c4-dicarboxylate anaerobic carrier family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000109992 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGGCA TAATTATATC TGTCATCGTA TTAATTACGA TGGGCTATTT GATCCTGAAA AACTACAAAC CTCAGGTGGT GCTGGCTGCC GCAGGTATCT TCCTGATGAT GTGCGGTGTC TGGTTAGGGT TCGGTGGTGT ACTCGATCCC GCCAAAAGCA GCGGCTACTT GATCGTCGAT ATTTATAATG AAATCCTGCG CATGCTGTCC AACCGCATTG CCGGATTGGG GCTGTCGATT ATGGCGGTGG GCGGTTATGC CCGCTACATG GAGCGCACAG GAGCCAGTCG CGCGATGGTG AGCTTGCTAA GCCGCCCGTT AAAACTCATT CGCTCGCCGT ATATTATTCT ATCGGCAACT TACGTCATCG GCCAAATCAT GGCGCAGTTT ATTACCAGCG CCTCCGGCCT GGGTATGTTG CTGATGGTCA CCTTATTTCC GACGCTGGTG AGTCTGGGAG TAAGTCGCCT CTCTGCGGTG GCGGTTATCG CAACCACGAT GTCCATTGAG TGGGGGATTC TGGAAACGAA CTCCATTTTT GCAGCCCAGG TCGCGGGAAT GAAAATTGCC ACTTACTTCT TCCACTACCA GCTTCCGGTC GCCTCTTGCG TCATTATCTC GGTGGCGATC TCCCACTTTT TCGTGCAACG CGCTTTTGAC AAAAAAGATA AAAATATCAA TCACGAACAG GCAGAGCTAA AAGCTCTCGA TAATGTCCCG CCGCTCTATT ACGCCATTTT ACCTGTGATG CCGTTAATCT TGATGCTCGG CTCGCTGTTC CTCGCCCACA TCGGGCTGAT GCAGTCAGAA CTGCATCTGG TGGTGGTGAT GTTACTGAGT TTGACGGTGA CGATGTTTGT TGAGTTCTTC CGCAAGCATA ACTTGCGCGA AACAATGGAC GATGTGCAGG CGTTTTTTGA CGGCATGGGT ACGCAGTTTG CCAACGTGGT AACGCTGGTG GTCGCGGGTG AAATATTTGC GAAAGGCTTA ACGACGATTG GCACTGTTGA TGCGGTTATC AGGGGTGCGG AGCATTCTGG TCTGGGCGGT ATTGGCGTGA TGATTATTAT GGCGCTAGTC ATTGCCATTT GTGCCATTGT GATGGGCTCT GGCAATGCGC CGTTTATGTC ATTTGCCAGT CTTATTCCGA ATATCGCAGC CGGACTACAT GTACCAGCGG TTGTAATGAT TATGCCGATG CATTTTGCCA CGACGCTAGC GCGCGCGGTT TCGCCGATTA CTGCGGTGGT GAAGCGAACA GCGATCCCCA TGGCAGTCGG TTTCGTGGTG AATATGATTG CCACAATCAC GCTATTTTAT TAA
|
Protein sequence | MFGIIISVIV LITMGYLILK NYKPQVVLAA AGIFLMMCGV WLGFGGVLDP AKSSGYLIVD IYNEILRMLS NRIAGLGLSI MAVGGYARYM ERTGASRAMV SLLSRPLKLI RSPYIILSAT YVIGQIMAQF ITSASGLGML LMVTLFPTLV SLGVSRLSAV AVIATTMSIE WGILETNSIF AAQVAGMKIA TYFFHYQLPV ASCVIISVAI SHFFVQRAFD KKDKNINHEQ AELKALDNVP PLYYAILPVM PLILMLGSLF LAHIGLMQSE LHLVVVMLLS LTVTMFVEFF RKHNLRETMD DVQAFFDGMG TQFANVVTLV VAGEIFAKGL TTIGTVDAVI RGAEHSGLGG IGVMIIMALV IAICAIVMGS GNAPFMSFAS LIPNIAAGLH VPAVVMIMPM HFATTLARAV SPITAVVKRT AIPMAVGFVV NMIATITLFY
|
| |