Gene SeAg_B4814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4814 
Symbol 
ID6792971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4692627 
End bp4693955 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content45% 
IMG OID642778879 
Productprotein HipA 
Protein accessionYP_002149440 
Protein GI197248365 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGAA CACAGCAGCG TTTATCAATA TGGATGAATG GAATCCGGGT CGGATTCTGG 
GAGAAGGCCA GAGGCGAGGA TTTATTACAA TACCTTCCAG AATGGATAAT TGATGAACAG
GGAAGACCTT TATCGCTTTC TTTGCCTTTC ACTCCAGGTA ATCAGCTTTG GCGTGGTAAT
GTTGTTCGTG ACTATTTTGA TAATTTATTG CCTGACAGCG AAAGTATACG CAGACGTTTA
GCCGTGCGTT ACCAGGCTGA AAGCCTTGAG CCTTTTGATC TATTGGCTGA GCTGGGAAGA
GACTGCGTTG GTGCAATACA GTTACTGAAT GTTGATGAAG AGCCCACAGA TTTATTTTCC
GTAAATTATC GCCCACTTTC TGAAGCTGAT ATCGCAACTA CATTGCGTAA TACTACGGCG
ATATCGTTGC CTGGTCGGCA GGACGAAACT GACGATTTGC GATTATCAAT TGCCGGTGCG
CAGGAAAAAA CGGCTTTATT GTGGCATGAA GAACGATGGT GTTTACCTGA AGGTAATACC
CCAACAACGC ATATCTTCAA ACTACCGCTT GGGTTGGTTG GGAACATGCA AGCGGATATG
AGTACATCGG TTGAAAATGA ATGGCTGTGT TCTTTGCTTG TTGAGCACTA CGGGATCCCT
GTAGCAAAAA CACAGATTGC GCAGTTTGAG GATCAGAAGG CATTAGTAGT TGAGCGTTTC
GACAGAAGAT GGTCAGGCGA TCGGCAATGG ATCATTCGTT TGCCACAAGA GGATATGTGT
CAGGCTTTAG GTGTTTCTCC GTTACGAAAA TACCAGTCTG ATGGTGGGCC GGGTATTTCC
GATATTATGG AAATACTGAG TCATTCAGAT CAGGCTGAGC AGGACAAAGA GCAGTTCTTC
AGGGCTCAAA TTATTTTCTG GTTGATGGCT GCTACTGACG GCCATGCCAA AAATTTCAGT
ATCGCTATTG AGCCACAAAG TCGTTACCAC CTTACGCCTC TTTACGATAT TTTATCAGCA
TGGCCGGTAA TTGGTCATGG TAATAATCAG ATTTCCTGGC AAAGATGCAA ACTGGCAATG
GCTGTTCGCG GTAGCAGTAA TTATTACCAC ATTTATAGAG TTCAACGACG GCATTGGATT
AATCAAGGTG AATTAAACGG ATTGGGAAGA CGACAAGTTG AGTCCATGAT GGATGACATT
ATATCCAGCA CACCTGAAGT CATTGAGCGT GTATCTGCGT TGCTTCCAGA GTCGTTTCCA
TCTGAGCTTG CTGAGTGTAT TTTTGAAGGT ATGCGGCAGC AGTGTAGGCG TTTGGCTGGA
AGGGAATAA
 
Protein sequence
MRRTQQRLSI WMNGIRVGFW EKARGEDLLQ YLPEWIIDEQ GRPLSLSLPF TPGNQLWRGN 
VVRDYFDNLL PDSESIRRRL AVRYQAESLE PFDLLAELGR DCVGAIQLLN VDEEPTDLFS
VNYRPLSEAD IATTLRNTTA ISLPGRQDET DDLRLSIAGA QEKTALLWHE ERWCLPEGNT
PTTHIFKLPL GLVGNMQADM STSVENEWLC SLLVEHYGIP VAKTQIAQFE DQKALVVERF
DRRWSGDRQW IIRLPQEDMC QALGVSPLRK YQSDGGPGIS DIMEILSHSD QAEQDKEQFF
RAQIIFWLMA ATDGHAKNFS IAIEPQSRYH LTPLYDILSA WPVIGHGNNQ ISWQRCKLAM
AVRGSSNYYH IYRVQRRHWI NQGELNGLGR RQVESMMDDI ISSTPEVIER VSALLPESFP
SELAECIFEG MRQQCRRLAG RE