Gene SeD_A1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1974 
Symbol 
ID6874296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1906125 
End bp1907471 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content48% 
IMG OID642785093 
Productdicarboxylate/amino acid:cation (Na+ or H+) symporter family protein 
Protein accessionYP_002215759 
Protein GI198242355 
COG category[R] General function prediction only 
COG ID[COG1823] Predicted Na+/dicarboxylate symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.416802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000000330732 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTGTTA CGCTCGCGTA TATCGCACTT TTTTTGGTTT TCTCGTGGGT CATCTTGAGA 
ATTAATCAAA AAAGCGATTC CCTGTCGAAA AGCGTTTTTA TCGCTATCTT TTTAGGGGCT
GTTATTGGTT TATCCCTGCA TTTTATTTCA GCAAATCACA CTAAAACTAT TATCGAATGG
TACAGCATCG TCGGCAATGG TTACGTCCAC CTGTTAAAAC TGGTCGCTAT ACCGCTAATT
TTTATTTCTA TTCTTTCCGC CATCAATAAA CTGGAAAATA GCGCCGGCAT CGGAAAAATG
TCGCTGACGA TCGTCGGATG CATGTTATGC CTGGTGATGG TTGCCGGTTT TATCGGATTA
CTGACCGCTC ATGTGCTGGG GCTTGACGCC AGCGCTTTTG TGCACATGCC GTCAATGTTA
ACTACAGAAG AAGTCAATAA GACTGCCGCG GTGTCGATTC CCCAGTTAGT GACATCGCTG
ATCCCGACTA ATATTTTTCT TGATCTTACG GGAGCCAGAA GTGTTTCCGT TATCGGCATC
GTTATCTTCA CGCTAATAGC GGGGATCGCT CTGTTAAAGG TCAAAAAAGA GGCGCCGGAA
GAAGGTCAGA AATTAAGCGC AGGCATTAAC GCTATCCAGA TCTGGGTCAT GAAGATGGTA
CGTATCGTTA TTGCATTAAC GCCCTATGGC GTCATGGCAT TAATGACTAC CGTATTTTCA
TCATACCACT TTGAACAATT CGCCAGCTTA CTTGGTTTTA TCGGCGCCTG TTATATCGCG
ATTTTCATGA TGTTTATCGT GCATGCCATC TTGCTGATCC TCAGCGGTAA TAATCCAGCG
CGTTATTTCA GTATGGTCTG GCCCGTCTTA ACCTTTGCGT TTGTTTCCCG CAGCAGCGCA
GCCTCTATCC CGCTGGCCAT TTCCGCCCAG GAAAAATTTG GCGTACAAAG CACTATTGCC
AATATTTCCG CCTCGTTTGG CTCCAGTATG GGGCAAAATG GCTGCGCCGG GATTTATCCG
GCTATTATGG TAGCGATGAT TGCGCCCACC ATCGGCATTG ATCCGCTCTC GCTGCATTTT
CTGGCCGCGA TGTTGCCTGC CATTGCGCTA GGGTCTATTG GCGTAGCCGG CGTCGGCGGC
GGCGGTACGT TCGCGGCGCT GATTGTCCTG TCGACGTTGA ATTTTCCCGT TGCGCTGGTC
GGTATTTTTA TTGCCATCGA ACCTATCGTT GATATGGCCC GCACGGCTCT GAACGTTAAC
GGATCGATGA TGTCAGGTGT GCTGGCTAAC CGTATTTTGA ATAATCATAC GGCTGACGAC
ATGCCAGCGG TTATTGACAG ACCTTAG
 
Protein sequence
MVVTLAYIAL FLVFSWVILR INQKSDSLSK SVFIAIFLGA VIGLSLHFIS ANHTKTIIEW 
YSIVGNGYVH LLKLVAIPLI FISILSAINK LENSAGIGKM SLTIVGCMLC LVMVAGFIGL
LTAHVLGLDA SAFVHMPSML TTEEVNKTAA VSIPQLVTSL IPTNIFLDLT GARSVSVIGI
VIFTLIAGIA LLKVKKEAPE EGQKLSAGIN AIQIWVMKMV RIVIALTPYG VMALMTTVFS
SYHFEQFASL LGFIGACYIA IFMMFIVHAI LLILSGNNPA RYFSMVWPVL TFAFVSRSSA
ASIPLAISAQ EKFGVQSTIA NISASFGSSM GQNGCAGIYP AIMVAMIAPT IGIDPLSLHF
LAAMLPAIAL GSIGVAGVGG GGTFAALIVL STLNFPVALV GIFIAIEPIV DMARTALNVN
GSMMSGVLAN RILNNHTADD MPAVIDRP