Gene Sama_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2097 
Symbol 
ID4604347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp2541383 
End bp2542420 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID639781482 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_927972 
Protein GI119775232 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACTC CTACCCCACT GAGCTATAAA GACGCCGGTG TCGATATCGA TGCAGGCAAC 
GCACTGGTGC AAAACATCAA GTCTGCCGTT AAGCGCACCC GCCGCCCTGA AGTGATGGGC
AACCTGGGTG GTTTTGGTGC CCTGTGTGAA CTGCCGACCA AATACAAGCA CCCAGTGCTG
GTATCCGGTA CCGACGGCGT GGGAACCAAG CTGCGTCTGG CCATTGACTT CAAGAGCCAC
GACACCGTGG GTATTGATCT GGTCGCCATG TGTGTGAACG ACCTGATTGT GCAGGGCGCT
GAGCCACTGT TCTTCCTCGA CTACTATGCC ACCGGCAAGC TGGACGTAGA GACAGCCACC
TCTGTAGTAA ATGGTATTGG TGAAGGCTGT TTCCAGTCAG GTTGCGCCCT GATTGGTGGT
GAAACCGCCG AAATGCCCGG CATGTACGAA GGCGAAGACT ACGACCTGGC CGGTTTCTGC
GTAGGTGTGG TTGAAAAGGC CGACATCATC GACGGCACCA AGGTAAAAGC CGGTGATGCG
CTGATTGCAC TCGCCTCAAG TGGTCCTCAC TCAAACGGTT ATTCTCTTAT CCGTAAGGTA
CTGGAAGTGA GCAAGGCCGA TCCTCAAATG GATCTGAACG GCAAGCCGCT CATCAAGCAC
CTGCTGGAAC CCACCAAGAT TTATGTCAAA TCACTGCTGA AGCTGATTGC CGAAAGCGAC
GTACACGCCA TGGCGCACAT CACCGGCGGT GGTTTCTGGG AAAACATCCC ACGCGTACTG
CCTGACAACT GCAAAGCCGT GGTTCAGGGC GATTCCTGGC AGTGGCCTGT GGTATTCGAC
TGGCTGCAAA CTGCCGGCAA TATCGAAACC TACGAAATGT ACCGCACCTT TAACTGCGGC
GTGGGCATGA TTGTTGCCCT GCCCGCCGAC AAGGTTGACG CGGCCCTTGA ACTCCTTAAG
GCCGAAGGTG AAAACGCCTG GCACATCGGC CATATCGCCG CGCGTAATGG CGATGAAGAG
CAGGTGGAGA TCCTCTGA
 
Protein sequence
MSTPTPLSYK DAGVDIDAGN ALVQNIKSAV KRTRRPEVMG NLGGFGALCE LPTKYKHPVL 
VSGTDGVGTK LRLAIDFKSH DTVGIDLVAM CVNDLIVQGA EPLFFLDYYA TGKLDVETAT
SVVNGIGEGC FQSGCALIGG ETAEMPGMYE GEDYDLAGFC VGVVEKADII DGTKVKAGDA
LIALASSGPH SNGYSLIRKV LEVSKADPQM DLNGKPLIKH LLEPTKIYVK SLLKLIAESD
VHAMAHITGG GFWENIPRVL PDNCKAVVQG DSWQWPVVFD WLQTAGNIET YEMYRTFNCG
VGMIVALPAD KVDAALELLK AEGENAWHIG HIAARNGDEE QVEIL