Gene YpAngola_A2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2454 
Symbol 
ID5800924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2573026 
End bp2574723 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID641340329 
Productputative sulfate transporter YchM 
Protein accessionYP_001606872 
Protein GI162420678 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGC ACGGAATTAA CGGCATTAGG CCGTTCAGCG CAATTATTGA CGCTTGCTGG 
CGAGAAACTT ATACACTTCA ACGACTTTTC AAAGACATAA TTGCCGGTGT CACCGTCGGT
ATTATTGCCA TCCCGTTGGC TATGGCATTG GCGATTGCCA GTGGTGTGCC ACCGCAATAC
GGCTTGTATA CTTCGGCTAT CGCCGGGATC GTGATCGCGG TCAGCGGTGG TTCCTGCTAC
AGCGTATCCG GCCCTACCGC AGCCTTTGTG GTCATTTTGT ATCCGGTGTC ACAACAGTTT
GGTCTAGCCG GCTTACTCTT GGCGACAATA TTATCCGGTC TATTTCTGCT CTGTATGGGG
TTTGCCCGGC TTGGGCGGTT GATCGAATAT ATACCGTTAT CAGTGACCCT CGGCTTCACT
TCAGGAATTG CCATCACCAT TGCAACCATG CAGGTGAAAG ATTTCTTCGG GTTACACCTG
GTCGCCGTGC CAGAAAATTA TGTTGGAAAA GTCACCGCCC TCGCACAGGC CATGCCAACG
ATTAATGTGA GTGATACGTT AATTGCGACG GTTACCTTGC TGGTATTAAT CCTTTGGCCA
CGGCTGAAAC TAAAGTTACC GGGTCACTTA CCTGCATTAG TCGCGGGTAC TGCAGTCATG
GGGGTGCTCT CACTGTTTGA TCAACAGGTT GCGACTATCG GCTCACGCTT TGGTTATCTC
TTGGCTGATG GGACTCAGGG CCAAGGTATT CCGCCTATTC TGCCGCAGTT TGTGCTGCCC
TGGCAGTTAC CTGCCGCAAA TGGTCAAGAG TTTGTGCTGA ACTGGGCGAC ACTTTCTGCA
CTGCTGCCCG CCGCCTTTTC AATGGCGATG TTAGGGGCGA TAGAGTCCTT GCTGTGCGCA
GTCGTCCTTG ATGGTATGAC TGGGCAGAAG CATCATTCAA ACAGTGAATT GGTCGGTCAA
GGGCTAGGCA ACATTGTCGC CCCCTTCTTT GGTGGGATTA CCGCAACGGC AGCCATCGCC
CGTTCTGCTG CTAACGTCCG CGCGGGTGCC ACCTCGCCGG TATCCGCGAT TATCCACGCG
GTGCTGGTGC TACTGGCGTT ACTGGTACTG GCACCGATGC TGTCGTACCT GCCATTAGCC
GCAATGGCTT CCCTGCTGTT GATTGTCGCC TGGAACATGA GTGAAGCCCA TAAGGTTATT
GACCTGCTGC GCCGTGCGCC TAAGGACGAC ATCATCATTA TGCTGCTGTG TATGAGCCTC
ACCGTATTGT TCGACATGGT GATCGCGATA ACGGTTGGGA TTGTGTTGGC ATCATTGTTG
TTTATGCGAC GTATTGCGCA AATGACACGC CTGAGTGAGA TACCTGCGGC GGTTAATGAT
AAGACATTGG TATTGCGAGT AAACGGCCCG TTGTTCTTCG CTGCCGCTGA ACGGATATTC
AGTGAGTTGC TCACCCGTTG CGATAATTAT GACACTATTA TCCTGCAATG GGATGCTGTG
CCCGTTCTCG ATGCCGGTGG CTTGAATGCT TTCCTGCGCT TTACTGAAGC GTTGACAGAG
CAGCAACTGC TGGTTATCAC AGATATTCCT TTCCAGCCAC TGAAGACGCT GGCCAGAGCC
AGGGTGAAGC CAATTTCAGG CAAGTTGAAT TTCTATGCTT CTTTACCGGA AGCACTGGCG
GCACTACAGA ATAACTAG
 
Protein sequence
MKTHGINGIR PFSAIIDACW RETYTLQRLF KDIIAGVTVG IIAIPLAMAL AIASGVPPQY 
GLYTSAIAGI VIAVSGGSCY SVSGPTAAFV VILYPVSQQF GLAGLLLATI LSGLFLLCMG
FARLGRLIEY IPLSVTLGFT SGIAITIATM QVKDFFGLHL VAVPENYVGK VTALAQAMPT
INVSDTLIAT VTLLVLILWP RLKLKLPGHL PALVAGTAVM GVLSLFDQQV ATIGSRFGYL
LADGTQGQGI PPILPQFVLP WQLPAANGQE FVLNWATLSA LLPAAFSMAM LGAIESLLCA
VVLDGMTGQK HHSNSELVGQ GLGNIVAPFF GGITATAAIA RSAANVRAGA TSPVSAIIHA
VLVLLALLVL APMLSYLPLA AMASLLLIVA WNMSEAHKVI DLLRRAPKDD IIIMLLCMSL
TVLFDMVIAI TVGIVLASLL FMRRIAQMTR LSEIPAAVND KTLVLRVNGP LFFAAAERIF
SELLTRCDNY DTIILQWDAV PVLDAGGLNA FLRFTEALTE QQLLVITDIP FQPLKTLARA
RVKPISGKLN FYASLPEALA ALQNN