Gene GBAA_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4307 
Symbolpyn-2 
ID2820098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3934466 
End bp3935770 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID637791011 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_020952 
Protein GI47529603 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00037416 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA 
ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA
CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCAGA TTTAACGATG
GCAATGGTAA ATAGTGGTGA TACAATCGAC TTATCAGCTA TTGAAGGTGT AAAAGTAGAT
AAGCACTCAA CAGGTGGCGT TGGTGATATA ACGACACTTG TATTAGGTCC ATTAGTAGCC
GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGCGGTACA
ATTGATAAAT TAGAAGCAGT TCCAGGGTTC CATGTTGAAA TCGAAAATGA TGAATTCATG
CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAGA GTGGAAACTT AACACCTGCG
GATAAAAAGT TATATGCACT CCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT
GCAAGCTCGA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA
AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA
ATGGTACGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT
CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA
CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CACTTGGAAG TCAAATGGTA
TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCGCGTG AGAAATTAAT TGAAGTAATG
AACAACGGTA AAGCGCTAGA ATCATTTAAA ACATTCTTAT CAGCGCAAGG CGGCGATGCA
TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG
AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TTGGAACAGC AGCAATGCTT
TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT
CGCAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC
CGTGAAAATG TAGAAGATGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG
CATGTAGATG CACCGACATT AGTGCATGGC ATCGTTACTG AATAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM 
AMVNSGDTID LSAIEGVKVD KHSTGGVGDI TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM
NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE
HVDAPTLVHG IVTE