Gene SeAg_B0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B0041 
Symbol 
ID6794279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp41723 
End bp42910 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content50% 
IMG OID642774353 
Productchondroitin sulfate/heparin utilization regulation protein 
Protein accessionYP_002145017 
Protein GI197247602 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGGA AAAGTTGTCA GGTCATGGTT AAACCAACCG GATCGGTGTG TAACCTTGAC 
TGTAAGTACT GTTTTTATCT GGAGAAAGAA AAGCTCTATC CGGATCGAAA AAACCATTAC
AAAATGTCGG AAGAGACCCT CGAACTCTTC ATCAGGCAGC AGATTGCCGC ACAGGATATT
GATGAGGTCA TTTTTGCGTG GCAGGGCGGG GAACCCACAT TAATGGGCAT CCCGTTTTAT
CGTAAAGCCG TTGAATTTCA GCAGCGCTAT TGTGGCGGCA AAACCATCGT CAATACCTTC
CAGACCAACG GCATCCTGAT CAACGACGAC TGGGCGACCT TCTTCCGGGA GCATGATTTT
CTGGTTGGCG TCTCTATTGA TGGCGATGCC GCGTTACACG ATGAATGGCG AGTGACGCGC
TCCGGAAAGC CGACGCATGA AAAAGTAGAA AATGCGGTGC GTTGTCTGGC GCAGCACAAC
GTGGAATTTA ATACCCTCAC GGTGGTTAAC CGTACCAATA TGCACCATCC TGTTCAGGTC
TATCGCTACC TGAAAAGCAT TGGTAGCCGC TATATGCAAT TTATCCCTTT AGTTGAACGA
TGCGGGGAAA ATGGGCTGGC GCAGCCGCAG GATAAACATA TCGCGATGAC GCCGTGGTCG
GTCGATAGCC TGCAATTTGG TCAGTTTCTG AATGCGGTAT TTGATGTCTG GATCCGTGAG
GATATCGGCG ATATCGGCAT CCAGCTATTT GAACAGACGC TGGCAGCCTG GTGCGGCCTG
CCGCCGCAGG TTTGCGTTTT TGCTCCCACC TGCGGCAGCG CGTTTGCGAT GGAAATGAAC
GGCGATGTTT ATAACTGCGA TCACTTCGTA TATCCGCAAT TTAAACTGGG GAATATCCAC
CAGAAGACGC TGCGTCAAAT GAATCAGGGC GAACAAAATC GCCAGTTCGG CAGCGATAAA
CAGCATTCAA TGGCGCAGGA GTGCCATCGC TGTCAATGGA AGTTCGCCTG CTATGGCGGC
TGTCCGAAAC ATCGTTTTTT ACCCTCCGCG TCAGGCGCAA CCAATCATAA CTATCTGTGT
GCAGGTTATC AGGCTTTTTT CTCGCATACC GCGACGGCGA TGAGTGCCAT GCGAACCCTG
TATGAAAAAG GCATCTCACC TGCAGAAATA AAGTCAATAT TTGTTTGA
 
Protein sequence
MFGKSCQVMV KPTGSVCNLD CKYCFYLEKE KLYPDRKNHY KMSEETLELF IRQQIAAQDI 
DEVIFAWQGG EPTLMGIPFY RKAVEFQQRY CGGKTIVNTF QTNGILINDD WATFFREHDF
LVGVSIDGDA ALHDEWRVTR SGKPTHEKVE NAVRCLAQHN VEFNTLTVVN RTNMHHPVQV
YRYLKSIGSR YMQFIPLVER CGENGLAQPQ DKHIAMTPWS VDSLQFGQFL NAVFDVWIRE
DIGDIGIQLF EQTLAAWCGL PPQVCVFAPT CGSAFAMEMN GDVYNCDHFV YPQFKLGNIH
QKTLRQMNQG EQNRQFGSDK QHSMAQECHR CQWKFACYGG CPKHRFLPSA SGATNHNYLC
AGYQAFFSHT ATAMSAMRTL YEKGISPAEI KSIFV