Gene SeD_A1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1603 
Symbol 
ID6872020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1545538 
End bp1546896 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content55% 
IMG OID642784749 
Productbifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase 
Protein accessionYP_002215417 
Protein GI198243276 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0134] Indole-3-glycerol phosphate synthase
[COG0135] Phosphoribosylanthranilate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.913758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.0208372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AGCCCGCAAA 
CAGCAACAGC CGCTGGCCAG CTTTCAAAAT GAGATCCAGC CAAGTACACG CCATTTTTAT
GATGCGCTCC AGGGCGCGCG TACCGCCTTT ATTCTGGAGT GTAAGAAAGC ATCGCCATCA
AAAGGCGTGA TTCGCGATGA TTTCGATCCG GCGCGTATTG CCAATATTTA TCAACATTAC
GCCTCGGCAA TCTCGGTGCT CACCGACGAA AAATATTTTC AGGGTAGCTT CGATTTTCTG
CCGGTCGTTA GCCAAAGCGC ACCGCAGCCG ATTCTGTGTA AGGATTTTAT TATCGATCCC
TATCAGATCT ACCTTGCCCG TTACTATCAG GCCGATGCCT GTTTACTGAT GCTCTCGGTT
CTGGATGACG AACAGTATCG CCAACTCTCC GCCGTCGCGC ACAGTCTGAA AATGGGCGTG
CTCACGGAAG TCAGTAATGA CGAAGAACGG GAGCGCGCGA TAGCGTTAGG CGCAAAAGTG
GTAGGTATCA ACAATCGCGA TCTGCGCGAT CTGTCGATTG ATTTGAATCG CACCCGCCAG
CTGGCGCCAA AACTGGGCCA CGGCGTGACT GTCATCAGCG AGTCCGGGAT TAACACCTAT
GGTCAGGTAC GCGAACTGAG CCACTTCGCC AACGGTTTTT TAATTGGCTC GGCGTTAATG
GCGCATGACG ATCTTAACGC CGCCGTCCGT CGCGTGCTGC TTGGCGAAAA TAAAGTCTGC
GGCCTGACCC GCGCCCAGGA CGCTAAAGCG GCCTGTGACG CTGGCGCAAT ATATGGCGGG
TTGATTTTTG TGCCCTCATC TCCACGCGCG GTGAGCGTTG AGCAGGCGCG AGAAGTGATA
AGCGGCGCGC CATTGCAGTA TGTCGGCGTT TTCCAGAACG CTGATATCGC CGATGTTTGC
CAGAAAGCCG CCGTCCTGTC GCTTTCTGCC GTACAGCTAC ATGGCAGCGA AGACCAGGCG
TATGTCAACG CGCTGCGCGA GGCGTTGCCG CGCAATGTGC AAATCTGGAA GGCGCTGAGC
GTTAGCAATG CCCTTCCCGC ACGCGATTAT CACCATGTCG ATAAATACAT TTTCGACAAT
GGGCAAGGCG GCAGCGGGCA GCGCTTCGAC TGGTCACTGC TACAGGGGCA ACCGCTGGAT
GATGTGTTAC TGGCGGGCGG GCTGGCGGCC GATAACTGCG TCCAGGCGGC GCAAGTCGGC
TGTGCCGGTC TCGATTTTAA TTCAGGTGTG GAGTCACAGC CGGGCATCAA AGATGCTCGT
CTTCTGGCCT CGGTTTTTCA GACACTGCGC GCATATTAA
 
Protein sequence
MQTVLAKIVA DKAIWVEARK QQQPLASFQN EIQPSTRHFY DALQGARTAF ILECKKASPS 
KGVIRDDFDP ARIANIYQHY ASAISVLTDE KYFQGSFDFL PVVSQSAPQP ILCKDFIIDP
YQIYLARYYQ ADACLLMLSV LDDEQYRQLS AVAHSLKMGV LTEVSNDEER ERAIALGAKV
VGINNRDLRD LSIDLNRTRQ LAPKLGHGVT VISESGINTY GQVRELSHFA NGFLIGSALM
AHDDLNAAVR RVLLGENKVC GLTRAQDAKA ACDAGAIYGG LIFVPSSPRA VSVEQAREVI
SGAPLQYVGV FQNADIADVC QKAAVLSLSA VQLHGSEDQA YVNALREALP RNVQIWKALS
VSNALPARDY HHVDKYIFDN GQGGSGQRFD WSLLQGQPLD DVLLAGGLAA DNCVQAAQVG
CAGLDFNSGV ESQPGIKDAR LLASVFQTLR AY