Gene BAS3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3345 
Symbol 
ID2850912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3315927 
End bp3317231 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content36% 
IMG OID637506589 
Productinosine-uridine preferring nucleoside hydrolase family protein 
Protein accessionYP_029602 
Protein GI49186350 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.117595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT TTATTTTCTT TCGGTGGATT AGAAGCAGCA TTCATTTGGC CGATTGTATT 
AGGGTTATAT TGGAGGAAAG GAAATGCAAC AGGAGCGCTT GCCTCCATTT TAGTTGGAGT
AAGCTCATAT ATGTGCATTC ATCTTTTCTA TCCAAATCCG TTCGGTATAC ATACAGTTGT
CTTCCCGATT TGTTTTGCGT TTATCGCTTA TATCTTAGGA AGCATGGTTG CTGTGAAGAA
AAACAGTATA GGTTAGATGA AAAACGTAGT CCTTACAGGA CTGCGTTTTT TCGTATAAAT
TTGTTCTTTT TATTCTGTAA TCCTGTGAAT ATTCATAAAT TAATAACAAT TGAGGGGATG
GAGTTGAGGG CGATGAAAAA AGTATTATTT TTAGGAGACC CAGGAATTGA TGACTCTTTA
GCAATTATGT ATGGATTGTT GCATCCTGAT ATTGATATTG TTGGTGTAGT AACTGGATAT
GGAAATGTAA CGCAAGAAAA GGCGACAAGT AATGCGGCAT ATTTATTGCA ACTGGCAGGA
CGGGAAGATA TACCTATTAT TAATGGTGCG AAAATCCCTT TATCTGGAGA TATTACAACG
TATTATCCAG AAATTCATGG GGCGGAAGGC TTAGGACCAA TTCGACCGCC GAAAAATCTT
TCTCCAAATA TAAGGCCTTT TTGTGAGTTT TTTGACATTC TTGAAAAATA TAAAGGAGAA
TTAATTATAG TTGATGCTGG GAGGTCAACG ACACTTGCAA CAGCATTTAT TTTAGAAAAA
CCATTGATGA AGTATGTGAA AGAATATTAT ATAATGGGCG GTGCTTTTTT AATGCCTGGA
AATGTTACAC CAGTCGCAGA AGCGAATTTT CATGGTGACC CTATTGCATC ACAATTAGTC
ATGCAAAATG CCAAGAATGT GACGTTGGTG CCGCTGAATG TTACATCTGA AGCTATAATC
ACGCCAGAGA TGGTAAAGTA CATTACGAAA CATTCTAAAA CGAGTTTTAA TAAATTAATT
GAACCGATTT TTACGTATTA TTATAAAGCT TATAGAAAGT TAAATCCGAA AATAACAGGA
AGTCCAGTAC ATGACGTTGT TACAATGATG GTCGCGGCGA ATCCTTCAAT ACTGGATTAT
GTGTATCGTC GTGTAGATGT AGATACAGTG GGGATTGCAA AAGGAGAAAG TATTGCAGAT
TTCCGTCCTC AACCTGATGC AAAAGCCTTA AAAAATTGGG TACGAATTGG TTGGTCATTA
CATTATAAAA AATTCCTTGA GGATTTTGTG AAAATCATGA CGTAG
 
Protein sequence
MVKFIFFRWI RSSIHLADCI RVILEERKCN RSACLHFSWS KLIYVHSSFL SKSVRYTYSC 
LPDLFCVYRL YLRKHGCCEE KQYRLDEKRS PYRTAFFRIN LFFLFCNPVN IHKLITIEGM
ELRAMKKVLF LGDPGIDDSL AIMYGLLHPD IDIVGVVTGY GNVTQEKATS NAAYLLQLAG
REDIPIINGA KIPLSGDITT YYPEIHGAEG LGPIRPPKNL SPNIRPFCEF FDILEKYKGE
LIIVDAGRST TLATAFILEK PLMKYVKEYY IMGGAFLMPG NVTPVAEANF HGDPIASQLV
MQNAKNVTLV PLNVTSEAII TPEMVKYITK HSKTSFNKLI EPIFTYYYKA YRKLNPKITG
SPVHDVVTMM VAANPSILDY VYRRVDVDTV GIAKGESIAD FRPQPDAKAL KNWVRIGWSL
HYKKFLEDFV KIMT