Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2171 |
Symbol | |
ID | 6871320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 2084391 |
End bp | 2085200 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642785277 |
Product | 4-amino-4-deoxychorismate lyase |
Protein accession | YP_002215940 |
Protein GI | 198246126 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase |
TIGRFAM ID | [TIGR03461] aminodeoxychorismate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.216662 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.000000000000025967 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTTTTAA TTAATGGCCA TGCGCAGGAT CAACTTGCTG TAAGCGACCG GGCGACGCAG TTCGGCGATG GCAGTTTTAC GACCGCACGT ATTGTTGATG GCAACATTTG CCATCTGGAA GCGCATCTTC AGCGCTTGCA GGTTGCCTGT GAAAAATTAC GGATCGCTTT TAGCCATTGG GCGACCCTTC GGCAAGAGAT GACAATGTTG GCCACAGGGC ATGATTCAGG CGTGTTGAAA GTGATCATTA GCCGTGGTAG CGGTGGCCGG GGATACAGCG CCATGAATTG TCAGGCAGCT ACCCGAATCC TCTCCGTTTC TGCTTATCCC GCTTATTATT CTCAGTGGCG TAAGCAAGGC ATCACTCTTA CCCTTAGTCC CATACCGCTT GGGCGCAATC CTTATCTTGC CGGATTAAAA CATCTGAACC GCCTCGAACA GGTGTTGATT CGCTCTCATC TTGAGCAGAC GGACGCCGAT GAGGCGCTGG TTCTTGACAG CGAGGGATGG GTTACGGAAT GCTGTGCGGC TAATTTGTTC TGGCGTACAG GCGACATTGT TTTTACGCCG CGTCTGGATC AGGCCGGGGT GAACGGCATT ATGCGACAAT TTTGTTTACG CCAACTGGCG CAATCTCCTT TCCAGGTTCT TGAAGTACAG GCGAGAGAGG AGGCAGTCAG GCAGGCGGAT GAGATTATCA TTTGCAACGC GCTAATGCCG ATTATTCCCA TACGCGCCTA TCACGGGACG TCGTATTCTT CGCGAACACT GTTTCAATTT TTAGCCCCAT TTTGTGAGCA TCCGAATTAG
|
Protein sequence | MFLINGHAQD QLAVSDRATQ FGDGSFTTAR IVDGNICHLE AHLQRLQVAC EKLRIAFSHW ATLRQEMTML ATGHDSGVLK VIISRGSGGR GYSAMNCQAA TRILSVSAYP AYYSQWRKQG ITLTLSPIPL GRNPYLAGLK HLNRLEQVLI RSHLEQTDAD EALVLDSEGW VTECCAANLF WRTGDIVFTP RLDQAGVNGI MRQFCLRQLA QSPFQVLEVQ AREEAVRQAD EIIICNALMP IIPIRAYHGT SYSSRTLFQF LAPFCEHPN
|
| |