Gene SeD_A4158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4158 
Symbol 
ID6873607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4004459 
End bp4005406 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content53% 
IMG OID642787095 
Productcarboxylate/amino acid/amine transporter 
Protein accessionYP_002217721 
Protein GI198244739 
COG category[R] General function prediction only 
COG ID[COG5006] Predicted permease, DMT superfamily 
TIGRFAM ID[TIGR00950] Carboxylate/Amino Acid/Amine Transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.565076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.615657 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCTG CACGTAAAAA AATAAAACAA CATGACGAGG AAAAAATGGG TTCCACCAGA 
AAAGGGATGC TAAACGTCCT GATTGCCGCC GTATTGTGGG GGAGTTCAGG TGTTTGCGCG
CAGTACATCA TGGAGCAAAG CCGTATGTCG TCACAGTTCC TGACGATGAT ACGTTTGTTA
TTCGCCGGGC TGATACTGGT GACCTTCTCC TTTATGCACG GCGATAAGAT ATTTTCGATT
CTTAAAAACC GCAAAGATGC CCTGAGTCTG CTGATTTTCT CCGTGGTGGG CGCGCTCACC
GTTCAGCTAA CCTTCCTGCT TACGATTGAA AAATCCAATG CCGCCACCGC GACAGTGCTG
CAATTTTTAT CGCCGACCAT TATTGTAGCG TGGTTTGCAT TAGCGCGAAG AACACGACCA
GGCATTCTGG TCTTAACCGC CATTCTTACA TCGCTTATCG GCACCTTTTT ACTGGTGACT
CACGGCAATC CAACATCGCT GTCGATCTCT TCAGGCGCGC TGTTCTGGGG TATCGCCTCC
GCATTTGCCG CCGCCTTTTA TACGACCTGG CCTTCCAGGC TAATCGCCCA ATACGGCACG
CTGCCAGTGG TCGGCTGGAG TATGTCCTTT GGCGGCCTTA TTCTGCTGCC CTTCTACGCT
AAAGAAGGAA CGCACTTTGC GGTGAGCGGC AGCCTGATTC TGGCCTTTTT CTACCTTGTG
GTGATCGGTA CGTCGCTGAC GTTCAGCCTG TATTTGAAAG GCGCGCAACT GATTGGTGGC
CCCAAAGCCA GCATTTTAAG CTGCGCGGAA CCGTTAAGCA GCGCCCTGCT GTCGCTACTG
CTGTTGGGGA TTAGTTTTAC CTTGCCGGAC TGGCTGGGCA CGCTGCTCAT TCTCTCGTCA
GTGATTCTGA TCTCCCTCGA TTCCCGTCGA CGCGCGCGGG CCGCTTAA
 
Protein sequence
MMPARKKIKQ HDEEKMGSTR KGMLNVLIAA VLWGSSGVCA QYIMEQSRMS SQFLTMIRLL 
FAGLILVTFS FMHGDKIFSI LKNRKDALSL LIFSVVGALT VQLTFLLTIE KSNAATATVL
QFLSPTIIVA WFALARRTRP GILVLTAILT SLIGTFLLVT HGNPTSLSIS SGALFWGIAS
AFAAAFYTTW PSRLIAQYGT LPVVGWSMSF GGLILLPFYA KEGTHFAVSG SLILAFFYLV
VIGTSLTFSL YLKGAQLIGG PKASILSCAE PLSSALLSLL LLGISFTLPD WLGTLLILSS
VILISLDSRR RARAA