Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4221 |
Symbol | |
ID | 5149043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 4434311 |
End bp | 4435261 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640559046 |
Product | hypothetical protein |
Protein accession | YP_001240183 |
Protein GI | 148255598 |
COG category | [S] Function unknown |
COG ID | [COG4765] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGCCG CAACGTCCCT CGCGCTGGCC GTGCCTGCAC GCGCGCAGAT CGGGACTATT TTTTCAGATC CCGTTCCTCG GCCGCCCGGC AACATTCCGC GTCGCGGAGA GCCGATTCCG CCGCCCGAGG AAGAGGAGGA AGTTCCGGAA CTGCCGCAGG GCCGCGTTCT GCCGGCACCG ACCCGTCCAG CGCCTGGGCC GGGGGCTGCC CTGCCGGGCC CGGTGCAGTC GCAGCCGCTG GCGCCGCCTC CGGGGACGGC CGTGCCCCCG ACCAATGCTC CCGTCGCAGT CGCTCCGCCT CCTGGACAGC CCGGCGCGGC GCCGCCGGCC GGCGGCCAGC GGCAGCCGCC GCGCGGTGCG CCCCAGAATG CAGCGGTTCC GCCGAATGGC GCCGTGCCGC AGACGCCGGC GACCCTGCAG CCGGGCGACG AGGTCGTGAC CGAGCCGCCG GCGCAGAAGA TCGTGAATAA GAAGGCGAGC TTCTCGGGCC TGGACAAGAT CACCGGGCGC ATCATCAATT TCGACGAGGA TATCGGCGAG ACGGTGCAGT TCGGCGCATT ACGGGTGAAG ACCGACGCCT GCTACACGCG TCCGGCCACT GAGGCCGCGA ACACCGACGC CTTCGTGCAG GTCGATGAGA TCACCCTGCA GGGTGAGGTG AAACGCATCT TCTCGGGATG GATGTTCGCC GCGAGCCCGG GCCTGCACGG TGTCGAACAT CCCATTTACG ACATCTGGCT GGTCGACTGC AAAGAGCCGC AGACGACCGT CGTGAGCACC GCACCCGACC AGAAGCCGGC GGCGCAGCAG CCGGCCCAGA AGCGGCCGCC GCAGCAGCGC CAAGCTGCCC CGCGGCCGCA GGCACCGCCG CAGCAGTATC AGACGCAGCA GATGCCGCCT CCGCCCCCGC CGCCTCAGGC GGGCGGACCG TTCGGCGGTG TGTTCAGATA G
|
Protein sequence | MLAATSLALA VPARAQIGTI FSDPVPRPPG NIPRRGEPIP PPEEEEEVPE LPQGRVLPAP TRPAPGPGAA LPGPVQSQPL APPPGTAVPP TNAPVAVAPP PGQPGAAPPA GGQRQPPRGA PQNAAVPPNG AVPQTPATLQ PGDEVVTEPP AQKIVNKKAS FSGLDKITGR IINFDEDIGE TVQFGALRVK TDACYTRPAT EAANTDAFVQ VDEITLQGEV KRIFSGWMFA ASPGLHGVEH PIYDIWLVDC KEPQTTVVST APDQKPAAQQ PAQKRPPQQR QAAPRPQAPP QQYQTQQMPP PPPPPQAGGP FGGVFR
|
| |