Gene SeSA_A2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2022 
Symbol 
ID6517100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1944300 
End bp1945406 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content51% 
IMG OID642747105 
Productexodeoxyribonuclease 8 
Protein accessionYP_002114906 
Protein GI194734908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGATA ACACCCCAAC AACCCATGAA GAAAATGCAG CAAGGCTTCG TCAGGCGGGC 
AAATGTCTGC GGGATATTGA GGCAGGGAGA TTTCAGTGTG ATGAAGAAAA ACCGCAACCG
ACAGGCGAAC TGGCAGATGA ACCAGCAACG CCTGAAGCAG TGGAACAGGA CACAACTGAA
CATCATCCGG ATCCACAGCC GCTGGAGAAT GAGCCACCTG TAAGCCAGAC AGAAGCAGGC
TACCAGAAAA TACGGGCAGA ACTGCACGAA GCACGTAAAA ACATTCCACC CAAAAACCCG
GTTGATGTTG GTAAACAACT GGCAGCCGTG CGCGGTGAAT ATGCCGAAGA CATCAGCGAC
CCGAACGATC CGAAGTGGGT TCCTAACAAT TACAGCGCCT CAAATCAGGG TGAAAAAGAA
GAAGTGGTGC CGGAGGAAAA ACAACTAGCA GCAGAGCCGG AGGCTGTCAC CAGAAACGCG
GACGGGACTT TCGATGTATC AGCGCTATTT CCGCCCCCCT CAAACCAGAC CGAAAAAACG
GAAGCCAGAA CAGAAAGAGA TGGAGAAACG CCGAAAGAGA GCAACCAGCA GGAAACGGCT
GGCGATACAG GGCAGGAAAT TACAACGGAC GGTGGATCAG GTACTGGCGG TGATGAAGCT
GGCGAAGCGG CAGATCCCGT AGAAAACGGA AATTTCACTG TCCCTGATGA TATACAGCCA
GGCATTTACT ATGACATCCC TAACGAGGCG TATCACGCTG GCCCGGGGGT CAGTAAATCA
CAGCTTGATG ATATCGCAGA TACACCAGCA ATTTATCTTT GGCGCAAAAA TGCCCCCGTG
GACACGGAGA AAACAAAATC TCTCGATACA GGAACGGCTT TTCACTGCCG GGTACTGGAA
CCAGAGGAAT TCAGTAAACG CTTCATCATC GCACCGGAGT TTAACCGCCG TACCAGTGCA
GGAAAAGAAG AAGAGAAAAC CTTTCTGGAA GAGTGCGCAC GGACAGGAAT AACCGTGCTT
ACGGCAGAAG AAGGCCGGAA AATCGAACTT ATGTACCAGA GTGTGATGGC GTTAACCGAG
TGCATTGCTG GAGAAGTTGA TCAGTGA
 
Protein sequence
MPDNTPTTHE ENAARLRQAG KCLRDIEAGR FQCDEEKPQP TGELADEPAT PEAVEQDTTE 
HHPDPQPLEN EPPVSQTEAG YQKIRAELHE ARKNIPPKNP VDVGKQLAAV RGEYAEDISD
PNDPKWVPNN YSASNQGEKE EVVPEEKQLA AEPEAVTRNA DGTFDVSALF PPPSNQTEKT
EARTERDGET PKESNQQETA GDTGQEITTD GGSGTGGDEA GEAADPVENG NFTVPDDIQP
GIYYDIPNEA YHAGPGVSKS QLDDIADTPA IYLWRKNAPV DTEKTKSLDT GTAFHCRVLE
PEEFSKRFII APEFNRRTSA GKEEEKTFLE ECARTGITVL TAEEGRKIEL MYQSVMALTE
CIAGEVDQ