Gene SNSL254_A1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1852 
Symbol 
ID6484299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1818060 
End bp1819418 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content55% 
IMG OID642737226 
Productbifunctional indole-3-glycerol phosphate synthase/phosphoribosylanthranilate isomerase 
Protein accessionYP_002040978 
Protein GI194444204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0134] Indole-3-glycerol phosphate synthase
[COG0135] Phosphoribosylanthranilate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.0214876 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCG TTTTAGCGAA AATCGTCGCA GACAAGGCGA TTTGGGTAGA AGCCCGCAAA 
CAGCAACAGC CGCTGGCCAG TTTTCAAAAT GAGATCCAGC CAAGTACACG CCATTTTTAT
GATGCGCTCC AGGGCGCGCG TACCGCCTTT ATTCTGGAGT GTAAGAAAGC ATCGCCATCA
AAAGGCGTGA TTCGCGATGA TTTCGATCCG GCGCGTATTG CCAGTATTTA TCAACATTAC
GCCTCGGCAA TCTCGGTGCT CACCGACGAA AAATATTTTC AGGGGAGCTT CGATTTTCTG
CCGGTCGTTA GCCAAAGCGC GCCGCAGCCG ATTCTGTGTA AGGATTTTAT TATCGATCCC
TATCAGATCT ACCTTGCCCG TTACTATCAG GCCGATGCCT GTTTACTGAT GCTCTCGGTT
CTGGATGACG AACAGTATCG CCAGCTCTCC GCCGTCGCGC ACAGTCTGAA AATGGGCGTG
CTCACGGAGG TCAGTAATGA CGAAGAACGG GAGCGCGCGA TAGCGTTAGG CGCAAAAGTG
GTAGGTATCA ACAATCGCGA TCTGCGCGAT CTGTCGATTG ATTTGAATCG CACCCGCCAG
CTGGCGCCAA AACTGGGCCA CGGCGTGACT GTCATCAGCG AGTCCGGGAT TAACACCTAT
GGTCAGGTAC GCGAACTGAG CCACTTCGCC AACGGTTTTT TAATTGGCTC GGCGTTAATG
GCGCATGACG ATCTTAACGC CGCCGTCCGT CGCGTGCTGC TTGGCGAAAA TAAAGTCTGC
GGCCTGACCC GCGTCCAGGA CGCTAAAGCG GCCTGTGACG CTGGCGCAAT ATATGGCGGG
TTGATTTTTG TGCCCTCATC TCCACGCGCG GTGAGCGTTG AGCAGGCGCG AGAAGTGATA
AGCGGCGCGC CATTGCAGTA TGTCGGCGTT TTCCAGAACG CTGATATCGC CGATGTTTGC
CAGAAAGCCG CCGTCCTGTC GCTTTCTGCC GTACAGCTAC ATGGCAGCGA AGACCAGGCG
TATGTCAACG CGCTGCGCGA GGCGTTGCCG AAACAGGTGC AAATCTGGAA GGCGCTGAGC
GTTAGCGATG CCCTTCCCGC ACGCGATTAT CACCATGTCG ATAAATACGT TTTCGACAAT
GGGCAAGGCG GCAGCGGGCA GCGCTTCGAC TGGTCACTGC TACAGGGGCA ACCACTGGAT
AATGTGTTAC TGGCGGGCGG GCTGGCGGCC GATAACTGCG TCCAGGCGGC GCAAGTCGGC
TGTGCCGGTC TCGATTTTAA TTCAGGTGTG GAGTCACAGC CGGGCATCAA AGATGCTCGT
CTTCTGGCCT CGGTTTTTCA GACACTGCGC GCATATTAA
 
Protein sequence
MQTVLAKIVA DKAIWVEARK QQQPLASFQN EIQPSTRHFY DALQGARTAF ILECKKASPS 
KGVIRDDFDP ARIASIYQHY ASAISVLTDE KYFQGSFDFL PVVSQSAPQP ILCKDFIIDP
YQIYLARYYQ ADACLLMLSV LDDEQYRQLS AVAHSLKMGV LTEVSNDEER ERAIALGAKV
VGINNRDLRD LSIDLNRTRQ LAPKLGHGVT VISESGINTY GQVRELSHFA NGFLIGSALM
AHDDLNAAVR RVLLGENKVC GLTRVQDAKA ACDAGAIYGG LIFVPSSPRA VSVEQAREVI
SGAPLQYVGV FQNADIADVC QKAAVLSLSA VQLHGSEDQA YVNALREALP KQVQIWKALS
VSDALPARDY HHVDKYVFDN GQGGSGQRFD WSLLQGQPLD NVLLAGGLAA DNCVQAAQVG
CAGLDFNSGV ESQPGIKDAR LLASVFQTLR AY