Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4342 |
Symbol | |
ID | 5152281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4557293 |
End bp | 4561513 |
Gene Length | 4221 bp |
Protein Length | 1406 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640559156 |
Product | TPR repeat-containing protein |
Protein accession | YP_001240293 |
Protein GI | 148255708 |
COG category | [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0457] FOG: TPR repeat [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.603254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAGC CCGCCTTCGC CTTACGTCCT CGGAGTATCA ACGCCTCGAC CGGTCCCGTC GCGCCTTCGG CGACGGCGAT CGATGAACGT GCTTTGCGGA TCACCGAACA GGCCTACCGA AAAGTGCTGG CGCTCCAACC GCACCATTTC CGCACCTTGT GCGGCCTCGC CATGGTGCGG CTGCAGTTGG GCGACGTCCA CGAAGCGCGC ACCCTGCTCG ATCAGGCAGC GCGCGAGGCC GGCGACTCCG CGGAATTTCA TCTGATGCTC GGCAAGGCCT TCGCTGGCCT GGGCGACCTG GCGACCTCGA GCGTCCATTT CCAGCGCGCC GTGGCGCTCG ACGACACGCT GATCCAGGCC CGGATCCTGC TGGGCAGCGC ATTCACCAAC CTCGGAGATC CGGCAGGCGC AGTCCGGCAC CTTGAGCTTG CGCTCGCGGC CGACGCAGAT GATGCCGACG CCCATCAGAC CCTTGGCTTC GCGCTGCAGC GCCTCGGACA ATTCGAGCGC GCGATGTCGC ATCACGAGGC GGCACTCGCG GCCCGGCCGC AATTCGCCGC CGCCGCTGCG AGTCTCGGAG ACGCCTGCCG CCAGCTCGGC CGGCATGCCG AAGCCATCGC CCATTACGAG CGCGCTCTCA CGCTCCAGCC CAATGCCCCA GCCGTCCTGC TCAATATCGG CGGCTGCCAG CAGGCCATCG GCCAGACCGA GGCCGCGGTC CGCACCTATC AACGCGCCCT CGTCCTGAGC CCGCATCTCG CCGAGGCGCA CTACAACCTC GGCAACCTTC ATCTGGAGAT GAACAGCTGG CCGATCGCAG TCTTCCACTA CGAGCGCGCG ATCGCCGAAC GGCCGGATTT TCCGGAGGCC CACAATAATC TCGCCAATGC GCTGCAGTCA CGCGGCCGGC ATGAGGAGGC GCTGGCGCAT TATGACGAGG CGCTGCGCCG CAGGCCGAGC TACGCAATCG CGCACCGCAA CCGCGCTGAT ACTCTGCGCA ACATGAAGCG GTTCGACGAG GCCATCGCCG GTTATCACGA CGCCCTCGCC CTCGAACCCG CCGATACGAC GACCTTGAAT CATCTCGCCG GCGTGCTGAT GATCGTCGGC CGGCTCGACG AGGCCGAACA GGCCTACCGA TCGGCGCTGG CGATCAATCC CCGCAACATC GGCGTCCATC TCAATTACGG CGTCGTCAAG CCGTTCACGG TCGATGATTC GCGTTGGCCG GCGCTGCAGG ATCTGGCGGC GAGCGTAGAG ACGCTGAGCG ACGACGCACG CATCACGCTG CACTTCACGC TTGGCCGCGC CTATGCCGAC GTCAAGGACG GCGAAAAGTC ATTGCGGCAC CTTCAGGCCG GCAATGCGCT CGAGCGACGC CGCATCAGCT ATGACGAGAA CCAGACGCTG CGTCAGATGG AGCGCATCCG CGACGTGTTC TCCCGGGACA TGCTGCAGGC GCGCGCCAAT CATGGTGACC CGTCGACCGC CCCCGTGTTC GTGATCGGCA TGCCGCGCTC CGGCACCAGC CTCATCGAGC AGATCCTCTC GAGTCACCCG GCGGCTTACG GGGCCGGTGA GGTGAACTAC TTCGCGGCCG CGACCGGGCT GTTCACCGAT CGTGCACGCA GCGACTATCC GGACATGCTG GCCAAGCTCG CAGATGCCGA TCTTGGCTCG ATCGCCGAGG CCTATCTCGC GCGCTTTACC GATCTGCCCG CAGGTGTGAC CCGCATCGTC GACAAAATGC CCTCGAACTT CCTGTTTGCG GGCCTGATCT ATCTTGCGCT GCCCAACGCG CGCATCATCC ACGTCCGACG CAATCCGATC GACACCTGCC TGTCCTGCTT TTCCCAGCTG TTTTCGGAGC CGCAGCCCTT TTCCTACGAT CTGGCCGAGC TCGGCCGCTA CTACCGTGCC TATGAGGCGC TGATGGAGCA CTGGCGCGCG ATCCTGCCGG ACGGCGTGAT GCTCGATGTG GCCTACGAGG ACGTGGTGCG CGATTTCGAG CCCCACGCGC GCAAGATCGT CGCCCATGCC GGCCTCGACT GGGATGAACG CTGCCGTTCG TTCCACGAGA CCAAGCGGCC GGTCAACACT GCGAGCCTGG TCCAGGTGCG CAAGCCGCTG TTCACCGGAT CGGTCGGCCG CTGGCGGCTG TATGGCGACC GGCTCAAGCC GTTGCTCGAC GCGCTGGGTC CAGCCGAGGT GCAGGCGCCC GTCGCCGACG CCATCTCCGG CTCGATGCAG CCGTCGCCGA CGAACCTCGC GCCGCCGGTG GCCACACCGA TCGGCGACAC CCCGCTTTTC GATGCCGACC AACTCGGAGC TTTGCAGACG CTGGCGGACG GCGCGGTGGC GGTGGCCAGG AAGCTCCAGG GGCGCGGCGA CAACAATGAT GCGGAAGCGA TCTTCCGGCT GATCCTCGCC GGCCAGCCAC GCCAGTTCGA CGCGCTGGTC GGGCTCGGCA TGATCTGCAG CGGCAGCAGC CGGCTCGACG AGGCGAAGGA TTGCTTCCAG CGGGCGGTCG CCGTCAACGC GAAATCCGCC GAGGCCCATG GCAGCATCGG CGCGGTCGAG GCTTCGGCCG GACGTTACGA TGCGGCCGTC GGCCATTACG AGACTGCGCT CTCTCTGTCC CCGAACCATC CCGGCATTCT CTACGCCTTC GCGATGGTGC GCCAGAACCA GGGGATGAGC GAGGAGGCGA TGGTGCTGCT GCGGCGCGCC ATCGAGAACA AGCCGCAGCA TCTCGACGCC CATTTTGCCC TGGGCAACCT GCTCTACACG GCCGGCAAGG ATATTGAAGC GGCGAAGTGC TACCTCAAGG TTCTCGAGTT CAGCCCGGAG CACGCCGAAA CGCACAACAA CATTGCCAAT GTGCTATTGC GCCAGGGCCA TCGCGAGCGC GCCATCGAGC ATTACAAGCG GGCGATCGCG AGCCGCCCCG ACTATGGCGA TGCCTATGGC AATCTCGGCA ATGCCTATCT CGAGCTCAAT CGCCTCGAAG AAGCGATCGA GCAGAACCTG CTCGCGCTCA AGCTGAAGCC GGAGCGGTTC GGCTCATACA ACAATCTGGG CGTGGCCTAT CAGGCGCTCG GCCGCTTCGA GGAGGCGACC GCAGCGTTCG AGAAGGCGCT GGAGCTGGCG CCGGACGATG CGCCGATCCA CCTCAACCTC GCCAACATGG CGAAGTTCAA GCCGGACGAT CGGCGGCTGC CTGGACTGCG GGCGTTGGTC GAACGTGCTG ATCAGCTCGA TCAGGAAAAG CAGATCGCCG CTCATTTCGC GCTCGGCAAA GCGTTGTCCG ATCTCAAGGA CTACGATGCT GCGTTCGCGC ATCTACGCCA GGCGAACACG CTCAAGCGCC AGAGCTTCGA CTACGATTCG GAGCAGCGTC TTGCGATGAT GAAGAATGTG GCCAGCCGCT TCACGCCGGA ATTCTTCCGT TCGGTGGTCG GACATGGCGA CGAGTCCTGG GCGCCGATCT TCATCGTCGG CATGCCGCGC TCCGGCACGA CCCTCATGGA ACAGGTGCTA TCGAGCCATT CGAAGGTATT CGGGGCCGGC GAGCTCGAGA CGTTCAAGGA GCTCGTGGGC GAATGCGCCA ACCGTCAGAG GGTGCCCCCT GCCTTTCCCG ACCTGATCGC GCTGCTGCCG CCGGAGGAGA TGACCGCGCT CGGCCGGGAA TATACGGCCC GCGTGCGCGT CCTGGCCCCC GAGGCCGAGC GTATTGTCGA CAAGATGCCG CTCAACTTCC TGTTTGTCGG ACTGATCCAT GCCGCCTTCC CTCGCGCCAG AATCATCAAC ACCCGGCGTG ATCCGCTGGA CAATTGCGTC TCCTGCTACT CGCTGCTGTT CACAGGAGCG CAGCCTTTCG CCTATGACCT GACCGAGCTC GGCCATTACT ACAGGGGCTA TGAGCGCGTG ATGGAGCACT GGCACGCCGT GCTGCCGCCC GGTGTCCTGA TGGACGTCCA ATATGAGGAT CTGGTCGATG ATCTCGAGGG CGTCTCGCGC CGCGTGCTCG CCCATTGCGA CCTTGACTGG GAAGAAGCGT GCCTCGACTT CCACCGGACC GAGCGCATGG TGCGGACGGC CAGCCTGATG CAGGTGCGCG AACCACTCTA CCGCCGCTCG ATCGGCAGCT GGCGCCGTTA TGAGAAGCAT CTCGGGCCGC TCTACGAGGC GCTCGGCATC GCGTCTCCGC CGCCAGCCTA G
|
Protein sequence | MNEPAFALRP RSINASTGPV APSATAIDER ALRITEQAYR KVLALQPHHF RTLCGLAMVR LQLGDVHEAR TLLDQAAREA GDSAEFHLML GKAFAGLGDL ATSSVHFQRA VALDDTLIQA RILLGSAFTN LGDPAGAVRH LELALAADAD DADAHQTLGF ALQRLGQFER AMSHHEAALA ARPQFAAAAA SLGDACRQLG RHAEAIAHYE RALTLQPNAP AVLLNIGGCQ QAIGQTEAAV RTYQRALVLS PHLAEAHYNL GNLHLEMNSW PIAVFHYERA IAERPDFPEA HNNLANALQS RGRHEEALAH YDEALRRRPS YAIAHRNRAD TLRNMKRFDE AIAGYHDALA LEPADTTTLN HLAGVLMIVG RLDEAEQAYR SALAINPRNI GVHLNYGVVK PFTVDDSRWP ALQDLAASVE TLSDDARITL HFTLGRAYAD VKDGEKSLRH LQAGNALERR RISYDENQTL RQMERIRDVF SRDMLQARAN HGDPSTAPVF VIGMPRSGTS LIEQILSSHP AAYGAGEVNY FAAATGLFTD RARSDYPDML AKLADADLGS IAEAYLARFT DLPAGVTRIV DKMPSNFLFA GLIYLALPNA RIIHVRRNPI DTCLSCFSQL FSEPQPFSYD LAELGRYYRA YEALMEHWRA ILPDGVMLDV AYEDVVRDFE PHARKIVAHA GLDWDERCRS FHETKRPVNT ASLVQVRKPL FTGSVGRWRL YGDRLKPLLD ALGPAEVQAP VADAISGSMQ PSPTNLAPPV ATPIGDTPLF DADQLGALQT LADGAVAVAR KLQGRGDNND AEAIFRLILA GQPRQFDALV GLGMICSGSS RLDEAKDCFQ RAVAVNAKSA EAHGSIGAVE ASAGRYDAAV GHYETALSLS PNHPGILYAF AMVRQNQGMS EEAMVLLRRA IENKPQHLDA HFALGNLLYT AGKDIEAAKC YLKVLEFSPE HAETHNNIAN VLLRQGHRER AIEHYKRAIA SRPDYGDAYG NLGNAYLELN RLEEAIEQNL LALKLKPERF GSYNNLGVAY QALGRFEEAT AAFEKALELA PDDAPIHLNL ANMAKFKPDD RRLPGLRALV ERADQLDQEK QIAAHFALGK ALSDLKDYDA AFAHLRQANT LKRQSFDYDS EQRLAMMKNV ASRFTPEFFR SVVGHGDESW APIFIVGMPR SGTTLMEQVL SSHSKVFGAG ELETFKELVG ECANRQRVPP AFPDLIALLP PEEMTALGRE YTARVRVLAP EAERIVDKMP LNFLFVGLIH AAFPRARIIN TRRDPLDNCV SCYSLLFTGA QPFAYDLTEL GHYYRGYERV MEHWHAVLPP GVLMDVQYED LVDDLEGVSR RVLAHCDLDW EEACLDFHRT ERMVRTASLM QVREPLYRRS IGSWRRYEKH LGPLYEALGI ASPPPA
|
| |