Gene SeAg_B4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4197 
Symbol 
ID6794550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4092154 
End bp4093449 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content54% 
IMG OID642778308 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002148892 
Protein GI197251257 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTAG CAGGTTGTCA CGTAATGGCC AAACCCGGCG GCGCGATCTG CAATATTGAT 
TGCACATATT GCTTCTATCT TGAAAAAGAG GCGCTGTACC CGGAACGCAA TAAAAACTGG
CGGATGTCGG ACGAGACGCT GGAACAATTT ATACGCCAGC ATATTGCCGC GCAGAGTGGC
GACCGCATTG ACTTTGCCTG GCAGGGCGGC GAACCGACCA TGATGGGACT ACCGTTTTTC
CGTCGGGTTG TCGCATTATG TGAAAAGTAC GGCGATGGGC GAAAAATCAC TCATGCGTTG
CAGACGAACG GCATCCTGGT GAATGACGAG TGGGCGCGCT TTTTCGCTGA ACAGCATTTT
CTCATCGGTC TCTCTATCGA CGGTCCGGCG TCGTTACACA ACCACTATCG GCTTAATCGC
GCTGGAAAAG GAACTCATGA ACAGGTCGTC GCAGCCATGG CGCGGCTTAA AGCGCACCAT
GTCGACTTTA ATACCTTAAC CGTCGTGGGA AAACATAACG TCGGTCATGC AGCAGACGTC
TACGAATTTC TTCTGGCGGC GGGATCGCGT TTTATTCAGT TTATCCCGCT GGTAGAGCGA
ATGAGCACCG ATAACTCATC GGTACTTAAT CTGGTGATGC CCGGCGAAAG CGCGGCAAAG
CTGGCGCCAT GGACGGTACC GTCGTGGCAA TATGGCGAAT TTCTCAACCA GATCTTTGAT
ATCTGGGTTC GTCGCGACGT AGACCGCGTC TATGTGCAGA TGTTTGACGT GGCGTTAGCC
GCCTGGACGG CGCAGCAGCC GGTACTGTGT GTACATTCCG AGACTTGTGG ACATGCCTTC
GCGTTGGAGT CGAACGGCGA TCTCTACAAC TGCGACCACT TTGTCTACCC GGAACATCTG
CTGGGGAATA TCCACCAGCA CAGCATCAAA ACCTTAAATA ATAGCGAGCG GGCTATTGTG
TTTGGCGAGG CCAAGCGGGA GACCCTGACC GCCGATTGTC GTCGCTGTGA CTACCGCTTT
GCGTGTCATG GCGGCTGTCC GAAGCATCGC TTTGCCGTCT CGCCGTCCGG TCATCCTGCG
CATAATTACT TGTGTGCGGG CTATAAGCAT TTTTTCCAGC ACGTTACGCC GTATATGAAT
GTCTGGCGGG AGCTGCTGGC GCAAGGCTAT CCGATGGCAT CGATCATGCG CTGGCTGGCG
CAGGACGCGC GTAAAGACAC AGGAGCCGTC AGTCGTAACC ATCTCTGTCC CTGCGGCAGC
GGCAAAAAAT ATAAAAAATG CTGTGGTAAA GCATAG
 
Protein sequence
MAVAGCHVMA KPGGAICNID CTYCFYLEKE ALYPERNKNW RMSDETLEQF IRQHIAAQSG 
DRIDFAWQGG EPTMMGLPFF RRVVALCEKY GDGRKITHAL QTNGILVNDE WARFFAEQHF
LIGLSIDGPA SLHNHYRLNR AGKGTHEQVV AAMARLKAHH VDFNTLTVVG KHNVGHAADV
YEFLLAAGSR FIQFIPLVER MSTDNSSVLN LVMPGESAAK LAPWTVPSWQ YGEFLNQIFD
IWVRRDVDRV YVQMFDVALA AWTAQQPVLC VHSETCGHAF ALESNGDLYN CDHFVYPEHL
LGNIHQHSIK TLNNSERAIV FGEAKRETLT ADCRRCDYRF ACHGGCPKHR FAVSPSGHPA
HNYLCAGYKH FFQHVTPYMN VWRELLAQGY PMASIMRWLA QDARKDTGAV SRNHLCPCGS
GKKYKKCCGK A