Gene YpAngola_A3960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3960 
Symbol 
ID5802438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4212391 
End bp4213722 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content46% 
IMG OID641341746 
ProductCBS/transporter associated domain-containing protein 
Protein accessionYP_001608256 
Protein GI162419427 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.216105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATA GTATCTTACT GATTCTTTTT TTAATTGCGG TCAGCGCCTT CTTCTCGCTA 
TCAGAGATTT CATTGGCGGC TTCACGCAAA ATTAAACTAA AACTGCTGGC GGACGAGGGC
GATACCAACG CCTTACGAGT CCTGAAACTG CAAGAGACGC CAGGAATGTT CTTCACCGTG
GTCCAAATTG GCCTGAATGC TGTCGCCATT CTTGGTGGTA TTGTCGGTGA TGCCGCTTTC
TCCCCTTCGT TCAAACTCGT TTTTGAGCGT TTTATGGCTC CTGAGTTGGC CGATCAAGCC
TGTTTCGTTT GTTCTTTCGT GTTAGTGACC AGCTTATTTA TTCTGTTTGC TGATTTAACC
CCGAAACGCA TCGGTATGAT TTCACCTGAA GCGGTTGCCG TCCGGATCGT CAACCCAATG
CGCTTCTGCC TAATGATCTT CCGCCCATTA GTCTGGTTCT TCAATGGGAT GGCAAATCTT
ATCTTCCGCC TATTTAAATT ACCCATGGTC CGTAACGATG ACATCACTTC CGATGATATC
TATGCCGTGG TAGAAGCCGG TGCGCTCGCC GGAGTGCTAC GCAAGCAAGA GCATGAGTTG
ATTGAAAACG TCTTTGAGCT GGAGTCTCGA ACCGTTCCTT CCTCCATGAC TTCACGTGAA
AACGTGATTT ACTTTGATCT ACGGGAAAGC GAAGACAGTA TCAAAGATAA AATCTCCACA
CATCCGCACT CAAAATTCCT GGTATGTGAT GGCCACATTG ACCAAGTGGT GGGTTACGTT
GACTCTAAAG ACTTGCTGAA TCGGGTATTA GGTAACCAAA GTCTGGTACT CAGCAGTGGC
GTACAAATTC GTTCAGCTCT GATTGTGCCA GATACATTGA CACTTTCAGA AGCGTTGGAG
AGTTTTAAAA CCGCGGGTGA AGACTTCGCC GTGATCCTCA ACGAATATGC TTTAGTTGTT
GGGATAATTA CACTGAATGA CGTAATGACC ACGTTGATGG GCGATTTAGT TGGCCAAGGG
CAGGAAGAGC AAATTGTTGC CCGCGATGAG AATTCATGGC TGATTGAGGG CGGTACACCG
ATTGAAGATG TCATGCGCGT ACTGCATATC GACGATTTCC CGCAATCGGG CAATTATGAA
ACTATCGGCG GCTTTATGAT GTATATGCTG CGTAAAATTC CTAAACGAAC TGATTTTGTT
AAATATGCGG GTTACAAATT TGAAGTCGTC GATATTGATA GCTACAAGAT AGATCAGCTA
CTGGTGACAA GGCTCAGTGA CCAGCCAGCG CCAATCCTGC CAAAAGCACC ACACGAAAGC
AGTGACGCCT AG
 
Protein sequence
MLNSILLILF LIAVSAFFSL SEISLAASRK IKLKLLADEG DTNALRVLKL QETPGMFFTV 
VQIGLNAVAI LGGIVGDAAF SPSFKLVFER FMAPELADQA CFVCSFVLVT SLFILFADLT
PKRIGMISPE AVAVRIVNPM RFCLMIFRPL VWFFNGMANL IFRLFKLPMV RNDDITSDDI
YAVVEAGALA GVLRKQEHEL IENVFELESR TVPSSMTSRE NVIYFDLRES EDSIKDKIST
HPHSKFLVCD GHIDQVVGYV DSKDLLNRVL GNQSLVLSSG VQIRSALIVP DTLTLSEALE
SFKTAGEDFA VILNEYALVV GIITLNDVMT TLMGDLVGQG QEEQIVARDE NSWLIEGGTP
IEDVMRVLHI DDFPQSGNYE TIGGFMMYML RKIPKRTDFV KYAGYKFEVV DIDSYKIDQL
LVTRLSDQPA PILPKAPHES SDA