Gene Sbal223_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2549 
Symbol 
ID7086115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3034597 
End bp3035634 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content48% 
IMG OID643461442 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002358466 
Protein GI217973715 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000317859 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000231938 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCACAC CTACCCCACT GAGCTATAAA GATGCCGGCG TTGATATTGA TGCAGGTAAT 
GCACTGGTAA GTAACATTAA AGCAGCCGTT AAACGTACCC GTCGTCCAGA AGTTATGGGC
AACTTAGGTG GTTTTGGCGC CCTGTGTGAA ATCCCCACTA AATACAAGCA ACCGGTTTTA
GTGTCTGGCA CCGACGGTGT TGGAACTAAA TTGCGTTTAG CCATCGACTA TAAAAAACAC
GACACAGTCG GCATAGACTT AGTTGCTATG TGTGTGAACG ATTTAATCGT TCAAGGCGCT
GAGCCACTGT TTTTCCTCGA TTACTATGCG ACAGGCAAGC TGGATGTTGA AACCGCGACG
TCTGTTGTCA ACGGTATCGG CGAAGGTTGT TTCCAATCTG GTTGTGCGTT AATCGGCGGT
GAAACCGCTG AAATGCCAGG CATGTACGAA GGTGAAGATT ACGACCTAGC AGGTTTCTGC
GTGGGCGTAG TTGAAAAAGC CGACATCATT GACGGTAGCA AAGTTGCAGC GGGTGATGCG
CTTATCGCAT TAGCCTCGAG CGGCCCTCAT TCAAATGGTT ACTCTTTAGT ACGTAAAGTA
TTAGAAGTGA GCCAAGCAGA CCCTCAACAA GATCTCAATG GCAAACCGCT AATTGAGCAT
CTCCTTGAGC CAACCAAAAT TTACGTGAAA TCATTGCTGA AACTGATCGC AGCATCAGAC
GTACATGCAA TGGCACACAT TACTGGCGGC GGCTTCTGGG AAAACATCCC ACGCGTACTA
CCAGATAATT TAAAAGCGGT TATCCAAGGC GATTCATGGC AATGGCCTGC TGTTTTCAGT
TGGTTAATGG AAAATGGCAA CATTGCAGAA TATGAAATGT ATCGCACCTT CAACTGTGGC
GTCGGCATGT TAGTCGCGCT GCCAGCCGAT AAAGTGGATG CAGCACTTGC ATTACTGGCT
GCAGAAGGCG AACAAGCTTG GCTGATCGGT GCTATAGCAG ATCGTGAAGG CAATGAAGAG
CAAGTGGAGA TCCTGTAA
 
Protein sequence
MSTPTPLSYK DAGVDIDAGN ALVSNIKAAV KRTRRPEVMG NLGGFGALCE IPTKYKQPVL 
VSGTDGVGTK LRLAIDYKKH DTVGIDLVAM CVNDLIVQGA EPLFFLDYYA TGKLDVETAT
SVVNGIGEGC FQSGCALIGG ETAEMPGMYE GEDYDLAGFC VGVVEKADII DGSKVAAGDA
LIALASSGPH SNGYSLVRKV LEVSQADPQQ DLNGKPLIEH LLEPTKIYVK SLLKLIAASD
VHAMAHITGG GFWENIPRVL PDNLKAVIQG DSWQWPAVFS WLMENGNIAE YEMYRTFNCG
VGMLVALPAD KVDAALALLA AEGEQAWLIG AIADREGNEE QVEIL