Gene SNSL254_A0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0852 
SymbolhutG 
ID6485788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp857412 
End bp858353 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content61% 
IMG OID642736264 
Productformimidoylglutamase 
Protein accessionYP_002040024 
Protein GI194444635 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAT GGTATCCGGC TTCTCCGGCG CTCTGGCAGG GGCGCGATGA CAGTATAGAA 
GCGCCGGATG CGCGGCGTCT GTTTCAGACC GTCACGCGCA GCGAGACCTT TTCCCCCGAA
AACTGGCAGC AAAAGATAGC GTTAATGGGA TTTGCCTGCG ACGAGGGGGT AAAACGCAAT
GCAGGGCGTC CCGGCGCGGC AGGCGGCCCG GACGCGTTGC GTAAAGCGCT GGCGAATATG
GCCAGCCACC AGGGACATGA ACGGCTGGTG GATTTAGGCA ATTGGGTTGC GCCGACGCCC
GATCTGGAAG GCGCGCAGCA GGCCTTGCGC AATGCGGTAA GCCGCTGTCT GCGGGCCGGG
ATGCGCACGC TGGTGCTGGG CGGCGGGCAT GAAACCGCGT TTGGACACGG CGCGGGGGTG
CTGGACGCGT TTGCGCAGGA AAGCGTAGGG ATCATTAATC TTGATGCGCA TCTGGATCTC
CGTCAGACCG ACCGGGCAAC ATCCGGGACG CCGTTTCGTC AACTGGCGCA GCTATGCGAC
GCGCAGAGCC GCGCGTTTCA TTATGCCTGT TTCGGCGTGA GCCGTGCGGC GAATACGCAG
GCGTTGTGGC GGGAAGCGCA GTGGCGGAAT GTTACCGTGG TGGAGGATCT TGACTGCCAT
GACGCGCTGG CGCAGATGAC GCAGTTTATC GACAAGGTGG ATAAAATTTA TCTGACTATC
GATCTCGACG TATTGCCTGT CTGGGAAATG CCGGCCGTCT CCGCTCCCGC AGCGCTGGGC
GTGCCGCTGA TACAGGTTCT GCGTTTAATT GAGCCGGTTT GCCGCAGCGG AAAATTACAG
GCGGCGGATC TGGTTGAATT TAATCCACGC TTTGATGAAG ATGGCGCAGC GGCGCGCGTG
GCGGCGCGGC TTGGCTGGCA AATCGCGCAC TGGTGGCGTT AA
 
Protein sequence
MTQWYPASPA LWQGRDDSIE APDARRLFQT VTRSETFSPE NWQQKIALMG FACDEGVKRN 
AGRPGAAGGP DALRKALANM ASHQGHERLV DLGNWVAPTP DLEGAQQALR NAVSRCLRAG
MRTLVLGGGH ETAFGHGAGV LDAFAQESVG IINLDAHLDL RQTDRATSGT PFRQLAQLCD
AQSRAFHYAC FGVSRAANTQ ALWREAQWRN VTVVEDLDCH DALAQMTQFI DKVDKIYLTI
DLDVLPVWEM PAVSAPAALG VPLIQVLRLI EPVCRSGKLQ AADLVEFNPR FDEDGAAARV
AARLGWQIAH WWR