Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A1604 |
Symbol | trpD |
ID | 6872590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 1546900 |
End bp | 1548495 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642784750 |
Product | bifunctional glutamine amidotransferase/anthranilate phosphoribosyltransferase |
Protein accession | YP_002215418 |
Protein GI | 198244045 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0512] Anthranilate/para-aminobenzoate synthases component II [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.262187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.00317716 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGATA TTCTGCTGCT CGATAACATC GACTCGTTTA CCTGGAACCT GGCAGATCAG CTACGGACCA ACGGTCATAA CGTGGTGATT TACCGTAACC ATATTCCGGC GCAGACGCTT ATCGATCGCC TGGCAACAAT GAAAAATCCG GTGCTAATGC TCTCCCCCGG TCCGGGCGTT CCCAGCGAGG CAGGCTGTAT GCCGGAGCTG CTGACCCGAC TACGCGGCAA GTTACCGATC ATCGGCATTT GTCTGGGGCA TCAGGCGATT GTCGAAGCTT ACGGCGGTTA TGTCGGTCAG GCGGGAGAAA TCCTGCATGG CAAAGCCTCC AGCATTGAGC ATGACGGTCA GGCGATGTTC GCCGGGCTGG CGAATCCGCT ACCGGTCGCG CGTTATCATT CGCTGGTCGG CAGTAATGTT CCTGCCGGGC TAACCATTAA CGCCCATTTC AACGGCATGG TGATGGCGGT ACGTCATGAT GCGGATCGCG TTTGCGGTTT TCAATTTCAT CCTGAGTCCA TCCTGACGAC ACAGGGTGCG CGTCTACTGG AGCAAACATT AGCCTGGGCG CAGCAAAAGC TGGAACCGAC CAACACCCTA CAGCCAATTC TGGAGAAACT CTATCAGGCG CAAACTCTGA CGCAACAAGA GAGCCATCAG TTGTTTTCGG CGGTCGTCCG CGGCGAACTT AAACCTGAAC AGCTCGCCGC CGCGCTGGTG AGCATGAAAA TTCGCGGCGA GCATCCCAAT GAAATTGCCG GCGCCGCCAC TGCGTTGCTG GAAAATGCCG CCCCGTTCCC GCGCCCGGAC TACCTGTTTG CGGATATCGT TGGTACCGGC GGCGATGGCA GTAACAGTAT CAATATCTCT ACCGCCAGCG CCTTTGTCGC AGCGGCCTGT GGACTGAAAG TGGCGAAACA CGGCAACCGT AGCGTGTCCA GCAAATCGGG GTCATCCGAT CTGCTGGCGG CGTTCGGTAT TAATCTGGAT ATGAACGCCG ATAAATCACG TCAGGCGTTA GATGAACTGG GCGTCTGTTT CCTGTTCGCG CCGAAATATC ACGCCGGATT CCGTCACGCG ATGCCGGTTC GCCAACAGTT AAAAACGCGA ACCCTGTTCA ACGTACTCGG CCCGCTGATC AACCCGGCGC ATCCGCCGCT GGCATTGATT GGCGTCTATA GCCCGGAACT GGTGCTGCCG ATTGCGGAAA CCTTACGGGT ACTGGGCTAT CAGCGCGCGG CGGTGGTACA TAGCGGCGGC ATGGATGAGG TTTCGCTCCA TGCGCCGACG ATTGTCGCGG AGCTGCATGA CGGCGAAATT AAAAGCTATC AACTTACGGC GGAGGATTTT GGCCTGACGC CTTACCATCA GGATCAGTTG GCTGGCGGCA CGCCGGAAGA AAACCGTGAC ATTCTGACGC GGTTATTACA AGGTAAAGGC GATGCCGCGC ATGAGGCCGC CGTCGCCGCC AACGTGGCGA TGCTGATGCG TCTGCATGGT CAGGAAGATC TCAAAGCCAA CGCGCAAACC GTGCTTGATG TTCTGCGCAA CGGCACCGCA TATGACAGAG TCACCGCACT GGCGGCAAGA GGGTAA
|
Protein sequence | MADILLLDNI DSFTWNLADQ LRTNGHNVVI YRNHIPAQTL IDRLATMKNP VLMLSPGPGV PSEAGCMPEL LTRLRGKLPI IGICLGHQAI VEAYGGYVGQ AGEILHGKAS SIEHDGQAMF AGLANPLPVA RYHSLVGSNV PAGLTINAHF NGMVMAVRHD ADRVCGFQFH PESILTTQGA RLLEQTLAWA QQKLEPTNTL QPILEKLYQA QTLTQQESHQ LFSAVVRGEL KPEQLAAALV SMKIRGEHPN EIAGAATALL ENAAPFPRPD YLFADIVGTG GDGSNSINIS TASAFVAAAC GLKVAKHGNR SVSSKSGSSD LLAAFGINLD MNADKSRQAL DELGVCFLFA PKYHAGFRHA MPVRQQLKTR TLFNVLGPLI NPAHPPLALI GVYSPELVLP IAETLRVLGY QRAAVVHSGG MDEVSLHAPT IVAELHDGEI KSYQLTAEDF GLTPYHQDQL AGGTPEENRD ILTRLLQGKG DAAHEAAVAA NVAMLMRLHG QEDLKANAQT VLDVLRNGTA YDRVTALAAR G
|
| |