Gene SeD_A2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2079 
Symbol 
ID6871510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2010900 
End bp2012084 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content58% 
IMG OID642785191 
Productinner membrane transport protein YeaN 
Protein accessionYP_002215857 
Protein GI198243371 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2807] Cyanate permease 
TIGRFAM ID[TIGR00896] cyanate transporter 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.000109416 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACAG CCCTTTCACC ACGCGGCAAA CAGGGCGCTT TACTGATTGC CGGTATTTTG 
ATGATAGCCA CTACGCTGCG CGTAACATTT ACCGGCGCTG CGCCGTTGCT GGAGACAATC
CGTTCGGATT ACGGTCTTTC CACCGCCCAG ACGGGCCTGC TAACAACCTT GCCGCTTCTG
GCGTTTGCGC TGGTTTCGCC GCTGGCGGCC GGTATTGCCC GCCGGTTTGG GATGGAACGC
AGTCTGTTTG CCGCAATGCT GCTGATTTGC GCGGGTATTG CCCTCCGCTC CCTGCCCTCC
GCTGCATTGT TGTTTGCCGG AACTGCCATT ATTGGGTGCG GGATAGCGCT GGGTAATGTG
CTGCTACCAG GATTGATTAA GCGCGATTTT TCACAACATG TCGCCAGGCT GACGGGCGCG
TATTCATTGA CGATGGGCGC CGCCGCCGCG TTGGGGTCTG CGCTAGTGGT TCCGCTGGCG
TTGCACGGTT TTGGCTGGCG CGGCGCGCTG TTAATGCTGA TGCTGTTTCC GCTGTTGGCG
TTTCTTATCT GGTTGCCGCA GTGGCGCACT ACCCGCTCAG CTAACCTGAG TAGCTCCCGG
GCATTACATG AACGCGGTAT CTGGCGTTCG CCTTTAGCCT GGCAAGTTAC GTTGTTTTTG
GGACTTAACT CACTGATTTA TTATGTAATT ATCGGCTGGT TACCAACAAT ACTCATCAGC
CACGGTTACA GCGAAGCACA GGCAGGATCG CTACATGGCC TGCTGCAACT GGCGACCGCG
GCGCCTGGGT TAGCCATTCC GCTTATTTTG CCTCGCTTTA ACGATCAACG CTGGATTGCC
GCGCTGGTCT CACTCTTGTG CGCAGTGGGC GCGGCGGGGC TTTGGTTTGT GCCGGGCCAG
GCGATCATCT GGACGCTACT GTTCGGCTTC GGTACAGGCG CGACGATGAT CCTCGGCCTG
ACGTTTATCG GCCTGCGCGC CAGTTCGGCG CATCAGGCGG CGGCGCTTTC CGGCATGGCG
CAGTCGGTCG GGTATTTACT GGCGGCATGT GGGCCGCCCG TCATGGGAAA ACTTCATGAT
GCCAGCGGTA GCTGGTATCT GCCGCTATCG GGCGTAACGG TTCTGGCTAT CATCATGGCG
ATTTTAGGCC TCTACGCCGG ACGCGATCGA GAGATAGCGT CATAA
 
Protein sequence
MTTALSPRGK QGALLIAGIL MIATTLRVTF TGAAPLLETI RSDYGLSTAQ TGLLTTLPLL 
AFALVSPLAA GIARRFGMER SLFAAMLLIC AGIALRSLPS AALLFAGTAI IGCGIALGNV
LLPGLIKRDF SQHVARLTGA YSLTMGAAAA LGSALVVPLA LHGFGWRGAL LMLMLFPLLA
FLIWLPQWRT TRSANLSSSR ALHERGIWRS PLAWQVTLFL GLNSLIYYVI IGWLPTILIS
HGYSEAQAGS LHGLLQLATA APGLAIPLIL PRFNDQRWIA ALVSLLCAVG AAGLWFVPGQ
AIIWTLLFGF GTGATMILGL TFIGLRASSA HQAAALSGMA QSVGYLLAAC GPPVMGKLHD
ASGSWYLPLS GVTVLAIIMA ILGLYAGRDR EIAS