Gene SeSA_A3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3808 
Symbol 
ID6515601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3672536 
End bp3674023 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content54% 
IMG OID642748786 
Productprotein YhjJ 
Protein accessionYP_002116550 
Protein GI194734488 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCA CAAAAATTCG ACTCTTAGCG GGCAGTCTGT TGATGTTGGC CTCTGCCGGC 
TATGTGCAGG CAGATGCGCT CCAGCCCGAT CCGGCATGGC AACAGGGGAC GCTGGCTAAT
GGGTTACAGT GGCAAGTGTT GGCTACGCCT CAGCGCCCCA GCGATCGTAT TGAAGTTCGT
CTCCAGGTTA ATACCGGTTC GCTCACCGAA AGTACGCAAC AGAGCGGGTT CAGCCATGCG
ATTCCCCGTA TCGCGCTGAC GCAAAGCGGT GGTCTGGATG CCGCACAGGC ACGTTCTTTA
TGGCAGCAAG GGTTTGATCC GAAACGTCCC ATGCCGCCCG TTATTGTTTC TTATGATTCC
ACGCTCTATA ACCTCAGTTT ACCCAATAAC CGTAACGATC TGCTGAAAGA AGCGCTGACC
TATCTGGCTA ACGTCTCCGG TAAATTAACC ATTACGCCAG AGACGGTGAA TCATGCGTTA
AGCAGCGAAG ATATGGTTGC GACGTGGCCA GCAGATACTA AAGAGGGCTG GTGGCGTTAT
CGGCTGAAAG GATCGGCGTT ATTGGGGCAC GATCCCGCGG AACCGTTAAA GCAGCCGGTA
GACGCAGCCA AAATTCAGGC TTTCTATGAA AAATGGTACA CCCCGGATGC CATGACGCTG
ATTGTTGTCG GCAACATTGA TGCGCGCTCC GTCGCCGAGC AGATCAATAA AACGTTCGGT
ACGCTGAAAG GTAAACGCGA AACGCCCGCC CCGGTGCCGA CGCTTTCGCC GCTGCGGGCG
GAATCAGTGA GCATTATGAC CGATGCGGTG CGCCAGGATC GTCTCTCCAT TATGTGGGAT
ACGCCGTGGC AACCGATTCG CGAATCGGCG GCGCTGTTGC GCTACTGGCA GGCGGATCTG
GCGCGTGAAG CGCTGTTCTG GCATATCCAG CAAGAGCTTA CTAAAAATAA CGCGAAAGAT
ATTGGCCTGG GGTTTGACTG CCGGGTTCTG TTCCTGCGCG CGCAGTGCGC CATCAACATT
GAATCACCTA ATGATAAGCT CAATACCAAT TTGAGCCTGG TGGCGAATGA ACTGGCGAAA
GTACGCGATA AAGGTTTGTC GGAAGAGGAG TTTACTGCTC TGGTGGCGCA GAAAAATCTC
GAATTGCAAA AGCTGTTCGC GACCTACGCG CGTACCGATA CTGACATTTT GGCTGGACAG
CGTATGCGCT CGCTGCAGAA TCAGGTGGTG GATATCGCGC CGGAGCAGTA TCAGAAGTTG
CGTCAGAATT TCCTCAACAG CCTGACCGTC GATATGCTCA ATCAGAATCT ACGTCAGCAG
CTATCGCAGG AGATGGCATT GATTTTGCTG CAACCGCAAG GCGAGCCGGA ATTTAATATG
AAGGCGTTAA AGGCGACGTG GGATGAAATC ATGGTCCCGA CAACTGCCGC CGCTGTTGAA
GCAGATGAGA CGCATCCGGA AGTGACGGAT ACACCGGCGG CACAGTAA
 
Protein sequence
MQGTKIRLLA GSLLMLASAG YVQADALQPD PAWQQGTLAN GLQWQVLATP QRPSDRIEVR 
LQVNTGSLTE STQQSGFSHA IPRIALTQSG GLDAAQARSL WQQGFDPKRP MPPVIVSYDS
TLYNLSLPNN RNDLLKEALT YLANVSGKLT ITPETVNHAL SSEDMVATWP ADTKEGWWRY
RLKGSALLGH DPAEPLKQPV DAAKIQAFYE KWYTPDAMTL IVVGNIDARS VAEQINKTFG
TLKGKRETPA PVPTLSPLRA ESVSIMTDAV RQDRLSIMWD TPWQPIRESA ALLRYWQADL
AREALFWHIQ QELTKNNAKD IGLGFDCRVL FLRAQCAINI ESPNDKLNTN LSLVANELAK
VRDKGLSEEE FTALVAQKNL ELQKLFATYA RTDTDILAGQ RMRSLQNQVV DIAPEQYQKL
RQNFLNSLTV DMLNQNLRQQ LSQEMALILL QPQGEPEFNM KALKATWDEI MVPTTAAAVE
ADETHPEVTD TPAAQ