Gene SeD_A0570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0570 
Symbol 
ID6872447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp585858 
End bp587312 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content46% 
IMG OID642783790 
Productallantoin permease 
Protein accessionYP_002214477 
Protein GI198243659 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.635464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.884181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACATC AAAGAGAGCT ATACCAGCAG CGCGGTTATA GCGAAGACTT ATTACCTAAA 
ACAGAAACCC AACGAAACTG GAAGGCATTT AACTATTTCA CCTTATGGAT GGGATCTGTA
CATAACGTGC CAAATTACGT TATGGTCGGC GGTTTTTTTA TACTGGGCCT ATCAACATTT
AATATCATGT TGGCCATTAT TATCAGCGCA TTATTTATTG CGGCGGCGAT GGTAATGAAT
GGCGCGGCAG GCAGCAAATA TGGCGTTCCT TTTGCTATGA TATTGCGAGG TTCTTACGGC
GTCCGCGGCG CGCTATTCCC TGGATTATTG CGAGGGGGAA TCGCGGCAAT TATGTGGTTC
GGCTTACAGT GTTACGCGGG ATCGCTGGCA TTTCTTATTT TGATTGGGAA GATCTGGCCA
GGATTTCTGA CATTAGGCGG AGATTTCAAG CTGCTGGGTC TTTCACTGCC AGGACTAATT
ACTTTTCTAA TTTTTTGGAT TATTAACGTT GGCATCGGTT TTGGCGGTGG TAAAGTATTA
AATAAATTTA CCGCTATCCT CAATCCATGT ATTTACATTG TCTTTGGCGG CATGGCTATT
TGGGCAATAT CGCTGGTCGG CATTGGCCCG ATTCTGGACT ATCTGCCTTC AGGCGTGCAA
AAAGCAGAGC ACAGCGGCTT TCTGTTCCTG GTGGTGATTA ACGCCGTAGT CGCCGTCTGG
GCAGCGCCAG CGGTGAGCGC GTCCGATTTC ACGCAAAACG CGCATTCATT TCGCGCTCAG
GCATTGGGAC AAACGCTGGG GCTTATCGTA GCGTATATAT TATTCGCCGT AGCCAGCGTG
TGCATTATTG CCGGAGCCAG TATTCATTAT GGTATGGATA CCTGGAACGT GCTGGATATT
GTGCAGCGCT GGGACAGCCT GTTTGCTTCA TTCTTTGCGG TGTTGGTGAT TCTGATGACG
ACAATTTCAA CCAACGCCAC CGGTAATATT ATACCTGCGG GGTATCAAAT TGCGGCGCTT
GCCCCGACAA AGCTTAACTA TAAAAATGGC GTAATGATTG CCAGTATTAT CAGTCTACTG
ATTTGTCCAT GGAAATTAAT GGAGAATCAG GACAGTATTT ATCTGTTCCT CGATATTATC
GGCGGTATGC TTGGCCCGGT AATCGGCGTA ATGTTAGCTC ATTATTTTGT GGTAATGCGT
GGAAAAATTA ATCTTGATGA GCTGTACACC GCCAGCGGTG ATTACAAATA TTATGATAAT
GGATTTAACC TGACTGCATT TTCAGTCACC CTGGTCGCAG TTATTCTGTC ATTAGGCGGC
AAGTTTATAC CATTTTTGGA GCCTTTATCC CGCGTGTCCT GGTTTGTAGG CGTAATAGTT
GCGTTCGTTA CTTATGCGCT ATTGAAGAAA CGCACGGGTT TTGAAAATAC AGGAGAGAAA
AAACTCGCAG GTTAA
 
Protein sequence
MEHQRELYQQ RGYSEDLLPK TETQRNWKAF NYFTLWMGSV HNVPNYVMVG GFFILGLSTF 
NIMLAIIISA LFIAAAMVMN GAAGSKYGVP FAMILRGSYG VRGALFPGLL RGGIAAIMWF
GLQCYAGSLA FLILIGKIWP GFLTLGGDFK LLGLSLPGLI TFLIFWIINV GIGFGGGKVL
NKFTAILNPC IYIVFGGMAI WAISLVGIGP ILDYLPSGVQ KAEHSGFLFL VVINAVVAVW
AAPAVSASDF TQNAHSFRAQ ALGQTLGLIV AYILFAVASV CIIAGASIHY GMDTWNVLDI
VQRWDSLFAS FFAVLVILMT TISTNATGNI IPAGYQIAAL APTKLNYKNG VMIASIISLL
ICPWKLMENQ DSIYLFLDII GGMLGPVIGV MLAHYFVVMR GKINLDELYT ASGDYKYYDN
GFNLTAFSVT LVAVILSLGG KFIPFLEPLS RVSWFVGVIV AFVTYALLKK RTGFENTGEK
KLAG