Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2408 |
Symbol | arnC |
ID | 6147504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2456450 |
End bp | 2457418 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617281 |
Product | undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase |
Protein accession | YP_001744453 |
Protein GI | 170683519 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGAAA TCCACCCTGT TAAGAAAGTC TCGGTGGTTA TTCCCGTTTA TAACGAGCAG GAAAGCTTAC CGGAATTAAT CAGGCGCACC ACAAAAGCCT GTGAATCACT GGGGAAAGAG TATGAGATCC TGCTGATTGA TGACGGCAGT AGCGATAATT CCGCGCATAT GCTGGTCGAA GCCTCACAAG CGGAGGGCAG CCATATTGTG TCTATTTTGC TTAACCGCAA TTACGGGCAA CATTCAGCGA TTATGGCGGG ATTCAGTCAC GTCACCGGCG ACTTAATTAT TACCCTCGAT GCCGATCTCC AGAATCCGCC AGAGGAAATC CCCCGTCTGG TGGCAAAAGC CGATGAAGGT TACGACGTGG TAGGGACTGT ACGCCAGAAC CGCCAGGACA GCTGGTTTCG TAAAACCGCT TCGAAGATGA TTAACCGGCT TATTCAGCGC ACCACTGGCA AAGCGATGGG TGACTATGGT TGTATGCTGC GCGCCTATCG CCGTCATATT GTCGATGCGA TGTTGCACTG CCATGAACGC AGCACCTTTA TCCCGATTCT GGCGAATATC TTCGCCCGCC GTGCCATTGA AATTCCAGTG CATCATGCCG AGCGTGAGTT TGGTGAATCC AAATACAGCT TTATGCGCCT GATTAATTTG ATGTACGACC TGGTGACCTG CCTTACCACT ACGCCGCTAC GTATGCTGAG TCTGCTCGGC AGCATTATTG CGATTGGCGG TTTTAGCATT GCGGTGCTGT TGGTGATTTT ACGCCTGACC TTCGGACCCC AATGGGCGGC AGAAGGCGTC TTTATGCTAT TTGCCGTGCT GTTTACTTTT ATTGGCGCTC AGTTTATCGG CATGGGATTG CTCGGTGAAT ATATCGGCAG GATCTACACC GATGTCCGCG CCCGCCCCCG CTATTTTGTT CAGCAAGTTA TCCGTCCATC CAGCAAGGAA AATGAATAA
|
Protein sequence | MFEIHPVKKV SVVIPVYNEQ ESLPELIRRT TKACESLGKE YEILLIDDGS SDNSAHMLVE ASQAEGSHIV SILLNRNYGQ HSAIMAGFSH VTGDLIITLD ADLQNPPEEI PRLVAKADEG YDVVGTVRQN RQDSWFRKTA SKMINRLIQR TTGKAMGDYG CMLRAYRRHI VDAMLHCHER STFIPILANI FARRAIEIPV HHAEREFGES KYSFMRLINL MYDLVTCLTT TPLRMLSLLG SIIAIGGFSI AVLLVILRLT FGPQWAAEGV FMLFAVLFTF IGAQFIGMGL LGEYIGRIYT DVRARPRYFV QQVIRPSSKE NE
|
| |