Gene SeSA_A1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1100 
Symbol 
ID6518965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1081708 
End bp1082940 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content55% 
IMG OID642746228 
Producthypothetical protein 
Protein accessionYP_002114038 
Protein GI194736176 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTGC CGTACCTTTC TCTTTCCCAG GCCCGGTGTC TTCACCTTGC TGCGCAGGGG 
CTATTGAAAA AGCCGCGCCG TAACGCGATG CCTGGCGATG TTCTTGCCGC CATCTCACGC
ATGGCGTTGT TGCAAATTGA TACCATCAAT GTTGTCGCAC GTAGCCCCTA TCTGGTGCTG
TTTAGCCGTC TCGGTTCGTA CCCGCAGGCC TGGCTGGATG AGGCGCTGCG ACGCGGCGAG
TTAATCGAAT ACTGGGCGCA TGAGGCCTGT TTCTTACCAC GCCGCGACTT TAAACTTATC
CGCCATCGTA TGCTGTCGCC GGAAAAGATG GGCTGGAAAT ATCGCGCGGC ATGGATGCAT
GAGCACGCGG AAGAAATAGA ACAGCTAGTG CGGCATATTC AGGAGCACGG TCCGGTGCGT
TCTGCCGATT TTGAGCATGT GCAGAAAGGC GTCAGCGGCT GGTGGGAATG GAAACCACAT
AAACGCCACC TTGAGGGTTT ATTTACCGCC GGAAAAGTCA TGGTTGTTGA GCGGCGTAAT
TTTCAACGTG TATATGATTT AACGCCCCGT GTGATGCCGC ACTGGGATGA TGGACGCGAT
GGACTGTCAC AGTCGCAGGC GGAAAGCCTG ATGCTGGATA ATAGCGCGCG CAGTCTGGGG
ATTTTCCGTG AACAGTGGCT GGCGGATTAC TACCGCCTGA AACGTCCTGA CCTGAAGGGA
TGGCGGGAGA GCCGGGCGGA ACAGCAGCAG ATTATTCCGG TCGAGGTGGA AACGTTGGGG
CGGATGTGGC TTCATGCCGA TCTTCTTTCG CAGCTTGAAC CGGCGCTAAA TAACGCCTTA
AAAGCGACCC ATAGCGCAGT ACTGTCGCCT TTCGATCCTG TGGTATGGGA TCGCAAGCGG
GCAGCGCAGC TCTTCGGATT TAACTATCGG CTGGAATGTT ATACGCCTGC GGCGAAGCGC
CAGTACGGTT ATTTTGTGCT GCCGCTATTA TACCAGGGCC GTTTAGTCGG GCGAATGGAC
GCCAAAATGC ACCGTAAAAC GGGGGGACTT GAGGTTATCT CGCTGTATCT GGAGGACGAT
ATTCGTCCTG GCGTTAATCT GCAAAAAGGA ATCTGGCAGG CGATTAGCGC GTTTGCTGCC
TGGCAACGGG CATCGCGCGT GACGCTGGGA CAATGTCCGC CAGGCCTGTT TAGCGCCATG
CGTCATGGCT GGGAAATAGA CCCTGCACCA TAA
 
Protein sequence
MSLPYLSLSQ ARCLHLAAQG LLKKPRRNAM PGDVLAAISR MALLQIDTIN VVARSPYLVL 
FSRLGSYPQA WLDEALRRGE LIEYWAHEAC FLPRRDFKLI RHRMLSPEKM GWKYRAAWMH
EHAEEIEQLV RHIQEHGPVR SADFEHVQKG VSGWWEWKPH KRHLEGLFTA GKVMVVERRN
FQRVYDLTPR VMPHWDDGRD GLSQSQAESL MLDNSARSLG IFREQWLADY YRLKRPDLKG
WRESRAEQQQ IIPVEVETLG RMWLHADLLS QLEPALNNAL KATHSAVLSP FDPVVWDRKR
AAQLFGFNYR LECYTPAAKR QYGYFVLPLL YQGRLVGRMD AKMHRKTGGL EVISLYLEDD
IRPGVNLQKG IWQAISAFAA WQRASRVTLG QCPPGLFSAM RHGWEIDPAP