Gene YpAngola_A1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1449 
Symbol 
ID5799918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1503175 
End bp1504701 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content47% 
IMG OID641339405 
Productputative transport protein 
Protein accessionYP_001605966 
Protein GI162420175 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0118167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.558521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC AGATTTCAAC AAGGCATACG TTGGCTTACG GGAGTGCTAA CTTGCTGGGC 
AGTGGGGCAC TCGCCATTAG TGGTGCCTGG CTGATGTACT TCTACACAAC GTTTTGCGGA
CTTTCTGTTG TAGAAGCAGC CACTATCTTT TCAATCGCCA GTATTATTGA TGCCATCAGT
AACCCGGTAA TGGGTTATAT CACGGATAAC TTCTACAATA CCCGTTTGGG CCGTATCTTC
GGGCGTCGTC GCTTCTTTAT CTTACTGGGT ATCCCACTCG TGTTGGTCTA TCCAATGCTG
TGGATGAGTG GTTTTGGTTT CTGGTATTAC CTACTGACCT ATGCGCTGTT TGAATTAATT
TACACCTCCA TCATGGTGCC TTATGAAACA CTGGCTACCG AGATGACAAC CGATTTTGCC
AAACGCTCTA AACTGACAGG TTCTAAAGCA ATCTTCGGCA AAGTAGCGAA CTTCCTGGCC
GCTTTCATTC CAGGTCAATT CATTGCAATT TATGGCAAAG ATTCCGCAAC CCCCTTCTTA
TATACCGGTA TCGCTTATGG TCTGATTATG TGCTGCGCCA TGATTTGGCT TTACAGCTCA
TCATGGGAGC GTCCAGCCAG TGAAGTGGTC AGGGAAACCA CCAGTAGTTT AGGTCAGGCA
CTGAAAAAGC TGTGTGTTGA TATGGCATCG ACCTTCCATC TGCGCATTTT CCGCAAGCAT
CTGGGGATGT ATCTGTTCGG TTTTGGTGCT GAATGGTTGT TCGCATCCGC ATTTACCTAC
TTTATTATTT TTGGCTTGGG ACAAGATGCC GCATTGGTTT CGCAGCTCAA TAGCTTCAGT
TCGATTATGC AGCTCATCTC CACCGCTATT TTTATTGGCA TCTGCGTAAA AATGGGTTTC
GCTCGTCCCT TCCGTATTGC TCTGCAAGTG GTGATTGTCA GCGTGGTTGC CTACGCAGCA
CTGTATTTCA CCGGTTGGTC TGAAACCACG ACCGTTATCG TCTTATTCTG CATTACCGCT
GTTTTCGGTT TAAGTACTGG TGGGATCTAC TATATTCCAT GGACAGTTTA TACCTTCCTG
GCCGATGTAG ACGAAGTCTT GACCGGGCGT CGCCGTGAAG GAATTTATGC TGGCGCGATG
ACATTCGCCG GGAAAATGGT ACGTTCGATT ATTGTCTTTG CCATGGGCTG GACGTTAAGC
CGCTTTGGTT TTGTTTCTGG TCAATCCAGT CAGCCAGAAA TTGCTGTGCA AGCCATCGTG
GGTGTCTTTG CCATAGGGGT TATCTCACTG GCATTAGTTG CTATTTACTA CACCACACAG
ATGAAACTGG ATCGCAAAAA TCACAGCATT TTACTGGAAG AAATTGAGCG TATTAAAAAT
GGCGGTGCCA TGGCAGATAT CCCCGCTCAT GCCAGAGCCG TCGCGGAAGA ACTGACGGGG
TGGAAATATG AACAATGTTG GGGCAATAAC CCCCTCGGCG TCAAAGAGTC GCCAACCGTC
ATACCCAAAC CGGTTACTGA AAGTTAA
 
Protein sequence
MSKQISTRHT LAYGSANLLG SGALAISGAW LMYFYTTFCG LSVVEAATIF SIASIIDAIS 
NPVMGYITDN FYNTRLGRIF GRRRFFILLG IPLVLVYPML WMSGFGFWYY LLTYALFELI
YTSIMVPYET LATEMTTDFA KRSKLTGSKA IFGKVANFLA AFIPGQFIAI YGKDSATPFL
YTGIAYGLIM CCAMIWLYSS SWERPASEVV RETTSSLGQA LKKLCVDMAS TFHLRIFRKH
LGMYLFGFGA EWLFASAFTY FIIFGLGQDA ALVSQLNSFS SIMQLISTAI FIGICVKMGF
ARPFRIALQV VIVSVVAYAA LYFTGWSETT TVIVLFCITA VFGLSTGGIY YIPWTVYTFL
ADVDEVLTGR RREGIYAGAM TFAGKMVRSI IVFAMGWTLS RFGFVSGQSS QPEIAVQAIV
GVFAIGVISL ALVAIYYTTQ MKLDRKNHSI LLEEIERIKN GGAMADIPAH ARAVAEELTG
WKYEQCWGNN PLGVKESPTV IPKPVTES