Gene SeD_A2952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2952 
Symbol 
ID6875380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2846955 
End bp2848151 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content46% 
IMG OID642785991 
Productmajor facilitator family transporter 
Protein accessionYP_002216641 
Protein GI198245286 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones91 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAT CAATCAACCG CTGGGGAATG CTTGCCGCTC ACGTGTGCAT CAATTTTGTG 
CTCGGGGGCG TCTACGCATT TAGCTATTTC AAAACACCAC TCATGGCGCA ATATCACTGG
GATCCGGCTA CGCTGGCGTT AGCATTCTCT ATCAATATGG GGATCATTCC TTTACCGATG
ATTTGGGGCG GGAGAATGAT CGACAATGGT AAAGGAAAGC AGGCGATAGT TATCGGTGGT
ATCCTGTTTT CTTTAGGTTT TATCTTGTCC GGGTTTGTGG ATAATTTGCC CATGCTGTTT
TTAACCTACG GCGTCATTGC CGGGTTGGGA TCGGGCCTGG CTTTTACGGG TAATCTTAAT
AATATTCTGA AATTTTTCCC TGACCGTCGC GGTCTTGCCA GCGGTATCGT ACTGGCGGGT
GTTGGCGTCG GGACGCTACT TTGCACCCGC CTGGCCGAAT ATTTTATGGC GCAAACTCAC
GATGTTAGTC GGGCGTTGTT ATATCTGGGT ATTGTTTATC TGGTGGTTAT TTTTATCGTC
CAGTTCTTTA TTCGTAGCGC GCCAGCAAAA GATAGCGGAG GAATTAAAGC GTCGCCACTG
GATAAAGACT ATCGGCACAT GCTGAAAGAT CTGCGCTTCT GGCTGCTGTT TATGATTCTG
GCGCTGGGCG TGTTCTCTGG GATGGTAATT AGCTCAAGCT CTGCGCAAAT TGGTATGACG
CAGTACGGTT TACTGTCCGG TGCATTAGTC GTTAGCCTGG TCTCGATATT TAACTCGATC
GGTCGCCTGT TCTGGGGAGG GTTAACCGAT AAATTAGGCG GCTATAATAC GCTGGTTATT
GTTTACCTCT TTACCTGCTT GTGTATGCTG CTGTTATTTT TCTTCAATGG CAATACTTCG
GTATTTTATT TCAGCGCTCT GGGCGTGGGC TTTGCTTATG CCGGTATATT AGTTATCTTC
CCTGGTTTGA CCAGCCAGAA TTTTGGTATG CGTAACCAGG GGCTAAACTA CGGCTTTATG
TATTTTGGTT TTGCCGTCGG TGCGGTTATT GCTCCTTATG TAACGTCCGC TATTGCAAAA
TATACCGGAA GCTACAATAC AGTATTTATT TTGACAACGG TGCTATTGCT TATTGGAGTC
GTGTTGACCC TGATAACGAA AAAATATGTC GCAACGGTTT TAGCCAAAAT TCATTAA
 
Protein sequence
MSASINRWGM LAAHVCINFV LGGVYAFSYF KTPLMAQYHW DPATLALAFS INMGIIPLPM 
IWGGRMIDNG KGKQAIVIGG ILFSLGFILS GFVDNLPMLF LTYGVIAGLG SGLAFTGNLN
NILKFFPDRR GLASGIVLAG VGVGTLLCTR LAEYFMAQTH DVSRALLYLG IVYLVVIFIV
QFFIRSAPAK DSGGIKASPL DKDYRHMLKD LRFWLLFMIL ALGVFSGMVI SSSSAQIGMT
QYGLLSGALV VSLVSIFNSI GRLFWGGLTD KLGGYNTLVI VYLFTCLCML LLFFFNGNTS
VFYFSALGVG FAYAGILVIF PGLTSQNFGM RNQGLNYGFM YFGFAVGAVI APYVTSAIAK
YTGSYNTVFI LTTVLLLIGV VLTLITKKYV ATVLAKIH