Gene SeHA_C4784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4784 
SymbolpurA 
ID6490824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4663035 
End bp4664333 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content54% 
IMG OID642744836 
Productadenylosuccinate synthetase 
Protein accessionYP_002048409 
Protein GI194451596 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0104] Adenylosuccinate synthase 
TIGRFAM ID[TIGR00184] adenylosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0521229 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.599456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAACA ACGTCGTCGT ACTGGGCACC CAATGGGGTG ACGAAGGTAA AGGAAAGATC 
GTCGATCTTC TGACTGAACG GGCTAAATAT GTTGTACGCT ACCAGGGCGG TCACAACGCA
GGCCATACTC TCGTAATCAA CGGTGAAAAA ACCGTTCTCC ATCTTATTCC ATCAGGTATT
CTTCGCGAGA ATGTAACCAG CATCATCGGT AACGGTGTTG TGCTGTCTCC GGCCGCGCTG
ATGAAAGAGA TGAAAGAACT GGAAGACCGT GGCATCCCCG TTCGTGAGCG TCTGCTGCTG
TCTGAAGCCT GTCCGCTGAT CCTTGATTAT CACGTTGCGC TGGATAACGC GCGTGAGAAA
GCGCGTGGCG CGAAAGCGAT CGGCACCACC GGTCGTGGAA TCGGGCCTGC TTATGAAGAT
AAAGTGGCAC GTCGCGGTCT GCGTGTTGGT GACCTTTTCG ACAAAGAAAC CTTCGCTGAA
AAACTGAAAG AAGTGATGGA ATATCACAAC TTCCAGTTGG TTAACTACTA CAAAGCTGAA
GCGGTTGATT ACCAGAAAGT TCTGGATGAT ACGATGGCTG TTGCCGACAT CCTGACTTCT
ATGGTTGTTG ACGTTTCGGA CCTGCTCGAC CAGGCGCGTC AGCGTGGCGA TTTCGTCATG
TTTGAAGGTG CGCAGGGTAC CCTGCTGGAT ATCGACCACG GTACTTATCC GTACGTAACT
TCTTCTAACA CCACTGCAGG TGGCGTGGCG ACCGGTTCCG GCCTGGGCCC GCGTTATGTT
GATTACGTTC TGGGTATCCT CAAAGCTTAC TCCACTCGCG TAGGTGCGGG TCCGTTCCCG
ACCGAACTGT TTGATGAAAC CGGCGAGTTC CTCTGCAAGC AGGGTAACGA ATATGGCGCT
ACTACCGGCC GTCGTCGTCG TACCGGCTGG CTGGACACCG TTGCCGTTCG TCGTGCGGTA
CAGCTGAACT CCCTGTCTGG CTTCTGCCTG ACCAAACTGG ACGTGCTGGA TGGCCTGAAA
GAGGTGAAAC TCTGCGTGGC TTACCGTATG CCGGATGGTC GCGAAGTGAC TACCACTCCG
CTGGCAGCTG ACGACTGGAA AGGTGTAGAG CCGATTTACG AAACCATGCC GGGCTGGTCT
GAATCCACCT TCGGCGTGAA AGATCGTAGC GGTCTGCCGC AGGCGGCGCT GAACTACATC
AAGCGTATTG AAGAACTGAC CGGCGTGCCG ATTGATATTA TTTCTACCGG CCCCGATCGT
ACTGAGACGA TGATTCTGCG CGACCCGTTC GACGCGTAA
 
Protein sequence
MGNNVVVLGT QWGDEGKGKI VDLLTERAKY VVRYQGGHNA GHTLVINGEK TVLHLIPSGI 
LRENVTSIIG NGVVLSPAAL MKEMKELEDR GIPVRERLLL SEACPLILDY HVALDNAREK
ARGAKAIGTT GRGIGPAYED KVARRGLRVG DLFDKETFAE KLKEVMEYHN FQLVNYYKAE
AVDYQKVLDD TMAVADILTS MVVDVSDLLD QARQRGDFVM FEGAQGTLLD IDHGTYPYVT
SSNTTAGGVA TGSGLGPRYV DYVLGILKAY STRVGAGPFP TELFDETGEF LCKQGNEYGA
TTGRRRRTGW LDTVAVRRAV QLNSLSGFCL TKLDVLDGLK EVKLCVAYRM PDGREVTTTP
LAADDWKGVE PIYETMPGWS ESTFGVKDRS GLPQAALNYI KRIEELTGVP IDIISTGPDR
TETMILRDPF DA