Gene SeHA_C2840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2840 
Symbol 
ID6489043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2776102 
End bp2777298 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content46% 
IMG OID642743009 
Productmajor facilitator family transporter 
Protein accessionYP_002046636 
Protein GI194448835 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.638107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value0.599456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAT CAATCAACCG CTGGGGAATG CTTGCCGCTC ACGTGTGCAT CAATTTTGTG 
CTCGGGGGCG TCTACGCATT TAGCTATTTC AAAACACCAC TCATGGCGCA ATATCACTGG
GATCCGGCTA CGCTGGCGTT AGCATTCTCT ATCAACATGG GGATCATTCC TTTACCGATG
ATTTGGGGTG GGAGAATGAT CGACAATGGT AAAGGAAAGC AGGCGATAGT TATCGGCGGT
ATCCTGTTTT CTTTAGGTTT TATCTTGTCC GGGTTTGTGG ATAATTTGCC CATGCTGTTT
TTAACCTACG GCGTCATTGC CGGGTTGGGA TCGGGCCTGG CTTTTACGGG TAATCTTAAT
AATATTCTGA AATTTTTCCC TGACCGTCGC GGTCTTGCCA GCGGTATCGT ACTGGCGGGT
GTTGGCGTCG GAACGCTACT TTGCACCCGC CTGGCCGAAT ATTTTATGGC GCAAACTCAT
GATGTTAGTC GGGCGTTGTT ATATCTGGGT ATTGTTTATC TGGTGGTTAT TTTTATCGTC
CAGTTCTTTA TTCGTAGCGC GCCAGCAAAA GATAGCGGAG GAATTAAAGC CTCGCCACTG
GATAAAGACT ATCGGCACAT GCTGAAAGAT CTGCGCTTCT GGCTGCTGTT TATGATTCTG
GCGCTGGGCG TGTTCTCTGG GATGGTAATT AGCTCAAGTT CTGCGCAAAT TGGTATGACG
CAGTACGGTT TACTGTCCGG TGCATTAGTC GTTAGCCTGG TCTCGATATT TAACTCGATC
GGTCGCCTGT TCTGGGGAGG GTTAACCGAT AAATTAGGCG GCTATAATAC GCTGGTTATT
GTTTATCTTT TTACCTGCGT CTGTATGCTG CTGCTGTTGT TCTTCAACGG TAATACTTCG
GTATTCTATT TCAGCGCTCT GGGCGTGGGC TTTGCTTATG CCGGTATATT AGTTATCTTC
CCTGGCTTGA CCAGCCAGAA TTTTGGTATG CGTAACCAGG GACTAAACTA CGGCTTTATG
TATTTTGGTT TTGCCGTCGG TGCGGTTATT GCTCCTTACG TCACGTCCGC TATTGCAAAA
TATACCGGAA GCTACAATAC AGTATTTATT TTGACAACGG TGCTATTGCT TATTGGAGTC
GTGTTGACCC TGATAACGAA AAAATATGTC GCAACGGTTT TAGCCAAAAT TCATTAA
 
Protein sequence
MSASINRWGM LAAHVCINFV LGGVYAFSYF KTPLMAQYHW DPATLALAFS INMGIIPLPM 
IWGGRMIDNG KGKQAIVIGG ILFSLGFILS GFVDNLPMLF LTYGVIAGLG SGLAFTGNLN
NILKFFPDRR GLASGIVLAG VGVGTLLCTR LAEYFMAQTH DVSRALLYLG IVYLVVIFIV
QFFIRSAPAK DSGGIKASPL DKDYRHMLKD LRFWLLFMIL ALGVFSGMVI SSSSAQIGMT
QYGLLSGALV VSLVSIFNSI GRLFWGGLTD KLGGYNTLVI VYLFTCVCML LLLFFNGNTS
VFYFSALGVG FAYAGILVIF PGLTSQNFGM RNQGLNYGFM YFGFAVGAVI APYVTSAIAK
YTGSYNTVFI LTTVLLLIGV VLTLITKKYV ATVLAKIH