Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_3331 |
Symbol | |
ID | 6201546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 3788582 |
End bp | 3789388 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641707279 |
Product | pyrimidine 5'-nucleotidase |
Protein accession | YP_001834379 |
Protein GI | 182680233 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01993] pyrimidine 5'-nucleotidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.142582 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCAAA CGACGAAAAA TTCTCCTCCC TCTCCGCTGG GTTCCCTTGC GCGAAAAACT CCGCCGTTTG CGGCGGAGAA TCAGGCTTCC TTGCCGTCGC ACCAGAATCG GGTCCTGGCC CATGTGGAGA CTTTCGTCTT CGATCTCGAT AATACGCTCT ACCCGTCCCA TTGCGATCTC TGGCCCAAGA TCGATGCACG CATCACCCTT TATATGATGC ATCACCTGGG GCTCGACGGC CTGTCCTCCC GCGCCTTGCA GAAACATTAT TATCACCATT ACGGCACGAC CTTGCGCGGA CTGATGCAGG AAGATGCAGT CGGTGCAGAA GACTTTCTAG CTTTCGTCCA TGACATAGAC CGCAGCTCGC TGCCGCCCAA TCCGACACTC GCCGACGCCA TTACCCGTTT GCCGGGCCGC AAGCTGATTC TGACCAATGG CTCGCGCGAT CATGCGCTCA ATACGGCCAA GGCCCTCGGA CTCGAGGCCT TGTTCGAGGA TGTTTTCGAT ATTGCCGACG CCGACTTCGT CCCCAAGCCG CATCCCACGG CCTATGAACG GTTCTTCGAC AAGCATGCCG TCGATCCAGC GCGTGCCGTC ATGTTTGAGG ATCTGACGAA AAACCTGCTC ATTCCGCATC AGCGCGGCAT GAAGACCGTG CTCGTCGTGC CGAAGCCCGG CCAATTGGAC CATCGCGACA AGATCGAGAT CGCCGGTCGC GAGATCCCGC CGCATATCGA CTATGTCACC GATGACCTCG AAAGCTTTCT GCTCGGGCTT CTTGAGGACG CCACGAACAA GCCGTGA
|
Protein sequence | MTQTTKNSPP SPLGSLARKT PPFAAENQAS LPSHQNRVLA HVETFVFDLD NTLYPSHCDL WPKIDARITL YMMHHLGLDG LSSRALQKHY YHHYGTTLRG LMQEDAVGAE DFLAFVHDID RSSLPPNPTL ADAITRLPGR KLILTNGSRD HALNTAKALG LEALFEDVFD IADADFVPKP HPTAYERFFD KHAVDPARAV MFEDLTKNLL IPHQRGMKTV LVVPKPGQLD HRDKIEIAGR EIPPHIDYVT DDLESFLLGL LEDATNKP
|
| |