Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3395 |
Symbol | arnC |
ID | 6969009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3138946 |
End bp | 3139914 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643387203 |
Product | undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase |
Protein accession | YP_002271666 |
Protein GI | 209398426 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGAAA TCCACCCTGT TAAGAAAGTC TCGGTGGTTA TTCCCGTTTA TAACGAGCAG GAAAGCTTAC CGGAATTAAT CAGGCGCACC ACCACAGCCT GTGAATCGTT GGGGAAAGAG TATGAGATCC TGCTGATTGA TGACGGCAGT AGCGATAATT CCGCGCATAT ACTGGTCGAA GCCTCACAAG CGGAGAACAG CCATATTGTA TCTATTTTGC TTAACCGCAA TTACGGGCAA CATTCAGCGA TTATGGCGGG ATTCAGTCAC GTTACTGGCG ACTTAATTAT TACCCTTGAT GCCGATCTCC AGAATCCGCC AGAAGAAATC CCCCGCCTGG TGGCAAAAGC CGATGAAGGT TACGACGTGG TAGGGACTGT ACGCCAGAAC CGCCAGGACA GCTGGTTTCG TAAAACAGCT TCGAAGATGA TTAACCGGCT TATTCAGCGC ACCACCGGCA AAGCGATGGG TGACTACGGT TGTATGCTGC GCGCCTATCG CCGTCATATT GTCGATGCGA TGTTGCACTG CCATGAACGC AGCACCTTTA TCCCGATTCT GGCGAATATC TTCGCCCGCC GTGCCATTGA AATTCCAGTG CATCATGCCG AGCGTGAGTA TGGTGAATCC AAATACAGCT TTATGCGCCT GATTAATTTG ATGTACGACC TGGTGACCTG CCTTACCACA ACGCCGCTAC GTATGCTGAG CCTGCTCGGC AGCATTATTG CGATTGGAGG TTTTAGCATT GCGGTGCTGC TGGTGATTTT ACGCCTGACC TTCGGACCAC AATGGGCGGC AGAAGGCGTC TTTATGCTAT TTGCCGTGCT GTTTACTTTT ATTGGCGCTC AGTTTATCGG CATGGGATTA CTCGGTGAAT ATATCGGCAG GATCTACACC GATGTCCGCG CCCGCCCCCG CTATTTTGTT CAGCAAGTTA TCCGTCCATC CAGCAAGGAA AATGAATAA
|
Protein sequence | MFEIHPVKKV SVVIPVYNEQ ESLPELIRRT TTACESLGKE YEILLIDDGS SDNSAHILVE ASQAENSHIV SILLNRNYGQ HSAIMAGFSH VTGDLIITLD ADLQNPPEEI PRLVAKADEG YDVVGTVRQN RQDSWFRKTA SKMINRLIQR TTGKAMGDYG CMLRAYRRHI VDAMLHCHER STFIPILANI FARRAIEIPV HHAEREYGES KYSFMRLINL MYDLVTCLTT TPLRMLSLLG SIIAIGGFSI AVLLVILRLT FGPQWAAEGV FMLFAVLFTF IGAQFIGMGL LGEYIGRIYT DVRARPRYFV QQVIRPSSKE NE
|
| |