Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2026 |
Symbol | |
ID | 7086860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 2394619 |
End bp | 2395530 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643460929 |
Product | UBA/THIF-type NAD/FAD binding protein |
Protein accession | YP_002357953 |
Protein GI | 217973202 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00215309 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAAA ACCCACCGCT TGCTCACGCA GAGGTTCTTT GCGGTCATAT ACACAGCGAC AATGAACACT GCGGCCATGC ACTCCGTGAT GCCGATTTCA TCCGTTATTC TCGGCAAGTC TTATTGCCAG AAGTGGGTGA GGCTGGGCAA TTACAACTGG CCGACGCTGG TGTCGCTATT ATCGGTCTTG GCGGTTTAGG GCAATTGGCG GCGCAATATC TGGCCTGTGC AGGTATTGGT CGCTTGACCT TAATCGATAT GGATAAGGTT GAAGTATCAA ATCTGCCAAG ACAATTGTTA TTCAACGATG CCGATATTGG ACTGAATAAG GCGCGAGTTG CCAAGCAAAA GCTCAATGGC ATAGCACCGC ATTGCACTGT TACTGCCTAT GAAACAGTAT TTAGTGCTGC GACATCAGCC CATCACTTCG CTGAGGTTTT ACAATTAAAA CAACAAGGTA AAAGAGTACT TGTGCTTGAC TGCACCGATA ACTTCGCGAC TCGCCAAGCC ATTAATCACA GCTGTATCGA AGCGGCTTTG CCTTTAGTCA GTGCGTCAAT CGCAGCATTT AGCGGTCAAT TGTTTGCAGT CGACCAAATG CGGTTCCCCT ATGGTGGTTG TTATCACTGT ATTTTCCCCG CACAAACGAG AGTGTCGCAG AGCTGCAGTA CCCAAGGCGT ACTCGGGCCG AGCGTAGGCG TGATGGCGTC AATGCAATCT TTGGTGGCCA TGCAACTCTT GTTGAATGTG GATAGTTGTG ATGAACCTAA AAGTGCCTTG TTGGGACGTT TTTGGCGCTT CGACGCTAAA TCACTTTCAT GGACAGCGGC GATATTAACG CGGGATCCCC ATTGTGATGT GTGTGGTCCA AAAGAAGTAC GCTCATCATC CGATAAGCCC AGCAAACTAT AA
|
Protein sequence | MSKNPPLAHA EVLCGHIHSD NEHCGHALRD ADFIRYSRQV LLPEVGEAGQ LQLADAGVAI IGLGGLGQLA AQYLACAGIG RLTLIDMDKV EVSNLPRQLL FNDADIGLNK ARVAKQKLNG IAPHCTVTAY ETVFSAATSA HHFAEVLQLK QQGKRVLVLD CTDNFATRQA INHSCIEAAL PLVSASIAAF SGQLFAVDQM RFPYGGCYHC IFPAQTRVSQ SCSTQGVLGP SVGVMASMQS LVAMQLLLNV DSCDEPKSAL LGRFWRFDAK SLSWTAAILT RDPHCDVCGP KEVRSSSDKP SKL
|
| |