Gene SeHA_C0629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0629 
Symbolncs1 
ID6489491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp629673 
End bp631127 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content46% 
IMG OID642740887 
Productallantoin permease 
Protein accessionYP_002044554 
Protein GI194447489 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATC AAAGAGAGCT ATACCAGCAG CGCGGTTATA GCGAAGACTT ATTACCTAAA 
ACAGAAACCC AACGAAACTG GAAGGCATTT AACTATTTCA CCTTATGGAT GGGATCTGTA
CATAACGTGC CAAATTACGT TATGGTCGGC GGTTTTTTTA TACTGGGCCT ATCAACATTT
AATATCATGT TGGCCATTAT TATCAGCGCA TTATTTATTG CGGCGGCGAT GGTAATGAAT
GGCGCGGCAG GCAGCAAATA TGGCGTTCCT TTTGCTATGA TATTGCGAGG TTCTTACGGC
GTCCGCGGCG CGCTATTCCC TGGATTATTG CGAGGGGGAA TCGCGGCAAT TATGTGGTTC
GGCTTACAGT GTTACGCGGG ATCGCTGGCA TTTCTTATTT TGATTGGGAA GATCTGGCCA
GGATTTCTGA CATTAGGCGG AGATTTCAAG CTGCTGGGTC TTTCACTGCC AGGACTAATT
ACTTTTCTAA TTTTTTGGAT TATTAACGTT GGCATCGGTT TTGGCGGTGG TAAAGTATTA
AATAAATTTA CCGCTATCCT CAATCCATGT ATTTACATTG TCTTTGGCGG CATGGCTATT
TGGGCAATAT CGCTGGTCGG CATTGGCCCG ATTCTGGACT ATCTGCCTTC AGGCGTGCAA
AAAGCAGAGC ACAGCGGCTT TCTGTTCCTG GTGGTGATTA ACGCCGTAGT CGCCGTCTGG
GCTGCGCCAG CGGTGAGCGC GTCCGATTTC ACGCAAAACG CGTATTCATT TCGCGCTCAG
GCATTGGGAC AAACGCTGGG GCTTATCGTA GCGTATATAT TATTCGCCGT AGCCAGCGTG
TGCATTATTG CCGGAGCCAG TATTCATTAT GGTATGGATA CCTGGAACGT GCTGGATATT
GTGCAGCGCT GGGACAGCCT GTTTGCTTCA TTCTTTGCGG TGCTGGTGAT TCTGATGACG
ACAATTTCAA CCAACGCCAC CGGTAATATT ATACCTGCGG GGTATCAAAT TGCGGCGCTT
GCCCCGACAA AGCTTAACTA TAAAAATGGC GTAATGATTG CCAGTATTAT CAGTCTACTG
ATTTGCCCAT GGAAATTAAT GGAGAATCAG GACAGTATTT ATCTGTTCCT CGATATTATC
GGCGGTATGC TTGGCCCGGT AATCGGCGTA ATGTTAGCTC ATTATTTTGT GGTAATGCGT
GGAAAAATTA ATCTTGATGA GCTGTACACC GCCAGCGGTG ATTACAAATA TTATGATAAT
GGATTTAACC TGACTGCATT TTCAGTCACC CTGGTCGCAG TCATTCTGTC ATTAGGCGGC
AAGTTTATAC CATTTATGGA GCCTTTATCC CGCGTGTCCT GGTTTGTAGG CGTAATAGTT
GCGTTCGTTG CTTATGCGCT ATTGAAGAAA CGCACGGGTT TTGAAAATAC AGGAGAGCAA
AAACTCGTAG GTTAA
 
Protein sequence
MEHQRELYQQ RGYSEDLLPK TETQRNWKAF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF 
NIMLAIIISA LFIAAAMVMN GAAGSKYGVP FAMILRGSYG VRGALFPGLL RGGIAAIMWF
GLQCYAGSLA FLILIGKIWP GFLTLGGDFK LLGLSLPGLI TFLIFWIINV GIGFGGGKVL
NKFTAILNPC IYIVFGGMAI WAISLVGIGP ILDYLPSGVQ KAEHSGFLFL VVINAVVAVW
AAPAVSASDF TQNAYSFRAQ ALGQTLGLIV AYILFAVASV CIIAGASIHY GMDTWNVLDI
VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAL APTKLNYKNG VMIASIISLL
ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MLAHYFVVMR GKINLDELYT ASGDYKYYDN
GFNLTAFSVT LVAVILSLGG KFIPFMEPLS RVSWFVGVIV AFVAYALLKK RTGFENTGEQ
KLVG