Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0479 |
Symbol | |
ID | 6068481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 519461 |
End bp | 520828 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641599884 |
Product | putative cryptic C4-dicarboxylate transporter DcuD |
Protein accession | YP_001723483 |
Protein GI | 170018529 |
COG category | [C] Energy production and conversion |
COG ID | [COG3069] C4-dicarboxylate transporter |
TIGRFAM ID | [TIGR00771] c4-dicarboxylate anaerobic carrier family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00235153 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.454228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGGCA TAATTATATC TGTCATCGTA TTAATTACGA TGGGCTATTT GATCCTGAAA AACTACAAAC CTCAGGTGGT GCTGGCTGCC GCAGGTATCT TCCTGATGAT GTGCGGTGTC TGGTTAGGGT TCGGTGGTGT ACTCGATCCC ACCAAAAGCA GCGGCTACTT GATCGTCGAT ATTTATAATG AAATCCTGCG CATGCTGTCC AACCGCATTG CCGGATTGGG GCTGTCGATT ATGGCGGTGG GCGGTTATGC CCGCTACATG GAGCGCATAG GGGCCAGTCG CGCGATGGTG AGCTTGTTAA GCCGCCCGTT AAAACTCATT CGCTCGCCGT ATATTATTCT GTCGGCAACT TACGTCATCG GCCAAATCAT GGCGCAGTTT ATTACCAGCG CCTCCGGTCT GGGTATGTTG CTGATGGTCA CCTTATTTCC GACGCTGGTG AGTCTGGGAG TAAGTCGCCT CTCTGCGGTG GCGGTTATCG CAACCACAAT GTCCATTGAG TGGGGAATTC TGGAAACGAA CTCCATTTTT GCTGCCCAGG TAGCGGGAAT GAAAATTGCC ACATACTTCT TCCACTACCA GCTTCCGGTC GCCTCTTGCG TCATTATCTC GGTGGCGATC TCCCACTTTT TCGTGCAACG CGCTTTTGAC AAAAAAGATA AAAATATCAA TCACGAACAG GCAGAGCAAA AAGCTCTCGA TAATGTCCCG CCGCTCTATT ACGCCATTTT ACCTGTGATG CCGTTAATCC TGATGCTCGG CTCGCTGTTC CTCGCCCACG TCGGGCTGAT GCAGTCAGAA CTGCATCTGG TGGTGGTGAT GTTACTGAGT TTGACGGTGA CGATGTTTGT TGAGTTCTTC CGCAAGCATA ACTTGCGCGA AACAATGGAC GATGTGCAGG CGTTTTTTGA CGGCATGGGT ACGCAGTTTG CCAACGTGGT AACGCTGGTG GTCGCGGGTG AAATATTTGC GAAAGGCTTA ACGACGATTG GCACTGTCGA TGCGGTTATC AGGGGTGCGG AGCATTCTGG TCTGGGCGGT ATTGGCGTGA TGATTATTAT GGCGCTGGTC ATTGCCATTT GTGCCATTGT GATGGGCTCT GGCAATGCGC CGTTTATGTC ATTTGCCAGT CTTATTCCGA ATATCGCAGC CGGACTACAT GTACCAGCGG TTGTAATGAT TATGCCGATG CATTTTGCCA CGACGCTAGC GCGCGCGGTT TCGCCGATTA CTGCGGTGGT GGTCGTTACG TCAGGAATTG CAGGCGTTTC GCCTTTTGCG GTGGTGAAGC GAACAGCGAT CCCCATGGCA GTCGGTTTCG TGGTGAATAT GATTGCCACA ATCACGCTAT TTTATTAA
|
Protein sequence | MFGIIISVIV LITMGYLILK NYKPQVVLAA AGIFLMMCGV WLGFGGVLDP TKSSGYLIVD IYNEILRMLS NRIAGLGLSI MAVGGYARYM ERIGASRAMV SLLSRPLKLI RSPYIILSAT YVIGQIMAQF ITSASGLGML LMVTLFPTLV SLGVSRLSAV AVIATTMSIE WGILETNSIF AAQVAGMKIA TYFFHYQLPV ASCVIISVAI SHFFVQRAFD KKDKNINHEQ AEQKALDNVP PLYYAILPVM PLILMLGSLF LAHVGLMQSE LHLVVVMLLS LTVTMFVEFF RKHNLRETMD DVQAFFDGMG TQFANVVTLV VAGEIFAKGL TTIGTVDAVI RGAEHSGLGG IGVMIIMALV IAICAIVMGS GNAPFMSFAS LIPNIAAGLH VPAVVMIMPM HFATTLARAV SPITAVVVVT SGIAGVSPFA VVKRTAIPMA VGFVVNMIAT ITLFY
|
| |