Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4604 |
Symbol | |
ID | 5149412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4823369 |
End bp | 4824589 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640559404 |
Product | hypothetical protein |
Protein accession | YP_001240538 |
Protein GI | 148255953 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGG CCGCCGTGAT CGAAAACCGG ACGGCGCAGT CGCCGCTGTC TGGACTTGCA CAGGTCGAGA TCGTCTCTGA TCTCGCCGCG GCCGAACCGG CCTGGCGCAT TCTCGAGGCA CCCGACCACA TCTCGACGCC CTACCAGCGC TTCGACCTGC TCGCCGCGTG GCAGCACGAG GTCGGCGCCC GCGAGCAGGC GACGCCCTTC ATCGTCATCG CTCGCGATGC CAAGCAGCAG CCGTTGCTGC TGCTGCCGCT GGCGCTGACG CACGCGTTCG GGGCGCGGGT CGCGAGCTTC ATGGGCGGCA AGCACACCAC CTTCAACATG CCGCTGATGC ACCGCGCGTT TGCGGCGCGT GCCAGCGTTG GCGATCTCGA ATTCCTGCTT GCCGGACTGC GTGATCATGG CGGCGTGGAC GTGCTGGCGT TGACGCAACA GCCGCTGCGC TGGAGCACGA TCGCCAACCC GCTGGCGCAA TGGCCGCGCC AGCCATCCGT GAACGACTGC CCGGTGCTGT TGATGCCCCC CGGCGCCGCA TCGACCGCTT TGTTGTCGAA CTCATTCCGC AAGCGGCTCA AGAGCAAGGA GAAGAAGCTG CAGGCGCTGC CCGGCTATCG CTACATGATC GCCAGCAGCG ATGCCGAGAT CGCCGAGCTG CTGGACTGGT TCTTCCGGAT CAAGCCGATC CGCATGGCCG AGCAGAAGCT GCCCAACGTC TTCGCCGAAC CCGGCATCGA AGCCTTCGTG CGCGCCGCAT GCCTGGCCAA GCTCAGCTGC GGTCATCGCG CCATCGAGAT TCATGCGCTG CGCTGCGACG ACGAGATCAT CGCGCTGTTC GCCGGCGTCG CCGATGGCGA GCGCTTCTCG ATGATGTTCA ACACCTATAC GCTGTCGGAG AATGCCCGCT GGAGTCCGGG ACTGATCCTG ATGCGCTCGA TCATCGATCA TTATGCGCAA AGCGGGTTCC GCGCGCTCGA TCTCGGCATT GGCTCCGACG ACTACAAGCG GATGTTCTGC AAGGATGACG AGCCGATCTT CGACAGCTAT CTGTCCTTGA CTGCGCGCGG GCTCGTGGCG GCGCGGACGA TGGCCGCGCT TGGCCGCGCC AAGCATGCCG TCAAGCACAG CCCTGCTCTG TTTCGTCTGG CGCAGCGCGT CCGCGGCGCG CTGCAGTCTG GCGGCAGCGC CGCCAGAGCC GAGGAACGGG CCGACGATTA G
|
Protein sequence | MTMAAVIENR TAQSPLSGLA QVEIVSDLAA AEPAWRILEA PDHISTPYQR FDLLAAWQHE VGAREQATPF IVIARDAKQQ PLLLLPLALT HAFGARVASF MGGKHTTFNM PLMHRAFAAR ASVGDLEFLL AGLRDHGGVD VLALTQQPLR WSTIANPLAQ WPRQPSVNDC PVLLMPPGAA STALLSNSFR KRLKSKEKKL QALPGYRYMI ASSDAEIAEL LDWFFRIKPI RMAEQKLPNV FAEPGIEAFV RAACLAKLSC GHRAIEIHAL RCDDEIIALF AGVADGERFS MMFNTYTLSE NARWSPGLIL MRSIIDHYAQ SGFRALDLGI GSDDYKRMFC KDDEPIFDSY LSLTARGLVA ARTMAALGRA KHAVKHSPAL FRLAQRVRGA LQSGGSAARA EERADD
|
| |