Gene SeAg_B4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4039 
Symbol 
ID6796084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp3935465 
End bp3936691 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content58% 
IMG OID642778155 
Productputative cytoplasmic protein 
Protein accessionYP_002148749 
Protein GI197247916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAA TCACCTTTAC GCCGCGCCAT CACCAGCTTA CCAACACCAA TACCTGGACA 
CCTGACAGCC AGTGGCTGGT CTTTGATGTA CGGCCTTCAG GCGCGTCATT TACGGGCAAG
ACTATTGAGC GTGTCAATGT GCATACCGGC GACGTGGAGG TGATTTATCG CGCCGTGCAG
GGCGCGCATG TCGGCGTGGT GACGGTGCAT CCTGCCGACA ATCACTATGT GTTTATTCAT
GGCCCTGAAA ACCCTGATGA GACGTGGCAT TACGATTTCC ACCACCGCCG GGGCGTTATT
GCAACGCCGG GGGGCGTGAC TAACCTCGAT GCGATGGATA TTACTGCGCC GTATACTCCC
GGCGCGCTGC GCGGCGGCAG TCACGTCCAT GTATTTAGCC CGAACGGCGA GCTGGTGAGT
TTTACCTATA ATGACCACGT TCTGCATGAG CGAGATCCGG CGCTGGATCT ACGTAATGTC
GGCGTGGCCG TGCCGTATGG GCCGGTAACG GTGCCGATTC AGCATGCGCG CGAATACAGC
GGTAGCTACT GGTGCGTACT GGTTAGCCGC ACGACGCCTG CGCCGAGACC TGGCAGCGAT
GACATTAACC GCGCCTATGA AGAGGGCTGG GTGGGCAACA GCCAGATCGC CTTTATTGGC
GATACGCTGT CGCTGACGGG CAAAAAAGTC CCGGAGCTGT TTATTGTCGA TTTACCGTGT
CATGAAAACA GCTGGAAACA GGCAGGCGAC ACGCCGCTGA CGGGAACCGA ATCAACGATG
CCATCGCCGC CGTTGGGCGT AGTTCAGCGG CGTCTTACCT TCACTCACCA GCGTGTTTAT
CCTGGACTCA CTAACGAACC ACGCCACTGG GTCCGCAGTA ATCCGCAGGC GACGGCGATC
GCCTTTTTGA TGCGGGACGA TAACGGCGTA GCGCAACTGT GGCTGATATC GCCTCAGGGA
GGCGAGCCCC GGCAGTTAAC GCATCATGCG ACCGGCGTAC AGTCAGCGTT TAACTGGCAT
CCGTCGGGTA AATGGCTGGG ACTGGTGCTG GAAAACCGGA TTGCCTGCTG CGACGCACAA
AGTGGGAGGA TCGACTTCTT AACCGCCAGG CACGACAACC CGCCGTCTGC GGACGCCGTC
GTTTTTTCAC CGGATGGGCG ACACGTCGCA TGGATGGAAG AGGTGAAAGG GTTCCGTCAG
CTATGGGTGA CGGAAACCGG GCGATAA
 
Protein sequence
MKQITFTPRH HQLTNTNTWT PDSQWLVFDV RPSGASFTGK TIERVNVHTG DVEVIYRAVQ 
GAHVGVVTVH PADNHYVFIH GPENPDETWH YDFHHRRGVI ATPGGVTNLD AMDITAPYTP
GALRGGSHVH VFSPNGELVS FTYNDHVLHE RDPALDLRNV GVAVPYGPVT VPIQHAREYS
GSYWCVLVSR TTPAPRPGSD DINRAYEEGW VGNSQIAFIG DTLSLTGKKV PELFIVDLPC
HENSWKQAGD TPLTGTESTM PSPPLGVVQR RLTFTHQRVY PGLTNEPRHW VRSNPQATAI
AFLMRDDNGV AQLWLISPQG GEPRQLTHHA TGVQSAFNWH PSGKWLGLVL ENRIACCDAQ
SGRIDFLTAR HDNPPSADAV VFSPDGRHVA WMEEVKGFRQ LWVTETGR