Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4962 |
Symbol | |
ID | 5153407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5199300 |
End bp | 5202710 |
Gene Length | 3411 bp |
Protein Length | 1136 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640559752 |
Product | hypothetical protein |
Protein accession | YP_001240881 |
Protein GI | 148256296 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACA CTCTGGCCGC CGCCTCGATC CCTCCGGTTT CAGCATCGAT CGGAAACCTA ACGTCTCTGA AACCTTATCC GGGCCTGAGG AGCTTCACGC AGGACGAAGC CTTGCTCTTC TTCGGTCGAG AAGTGCAGGT GCGCCAGATA CGCGACATCC TCGCGAATCG CAATCTACTG GTCGTGCTCG GCGGATCCGG CTCCGGCAAA TCCTCCCTGG TGCGCGCGGG CCTGCTGCCG AAGCTCAACA GCACGGCTCC CATTCCGAAG CGGAGCGGAG CCTGGTATGC GGTCGAATTC CGTCCATTGA CCAATCCCAT TGCAGAGCTT TTCGAGGCCA TATTCGTACA GATCTTTCAG CCGCTGTTGA ATACGCCCTC TTCCGTCGCG GGGACGGGAA CGGAAGGCCA GCCGACGCAG ACCGGTTTGG AAGAAAACAG GCGCCTCGCG GCTGTAAGTG CTGCATTGGA TATCGACCCT GCACTCCAGC CGAGCGAAGA CATCGAGAAA CGATGCAGAG AGCGGCTGCG AGAGCGTCTG TTCAAAGGTC GGACGATCGA CATCGGCTCG CTTTTCGCAT TTGCTGAGCA AACAATCGTA GTTCTCGACG AGAAGCTTGC CGCGGGCCCT CGCTCGGGGA AAGCGAATCT TCTGATCCTG ATCGACCAGT TTGAGGAAGT CTTCTCATTG CCGGCCGAAA ACAGGGAAGA CGGGCAGAGC ATGGTGATGT CGCTCGTTAC CGGCATCCAG GCATTCCGGC CCGACAACCT TTTCCTGATC GTCACGATGC GGAACGAGTG GCTGCACCGA TGCAGCGAGA TTCCCGGAGT GGCCGAAGCG ATGAACGGCT CGACCTATCT GGTCGATCTT CTGAACGATT CCGAAATCAG AAACATCATT GTCGAACCGG CCCGTTCGGT CCTGCGCGCG GCGAGACTCG ATCCGGGGCC GTCCAGTCGC GGACCCTATT CCCTAGACGT ACTTCGCCTC CTCCAGCAAG CCTTCGACGA TACCGCAGCC GTGCCGGATG CGTCGGATCG CCTTCCGCTC CTGCAGCACT TGCTGCAGCT GCTGTGGGAC AGCGCGGATC CGACCAGGCC CCACTTTTCG ATCGAGACCA GACACCTCGA AGCGATACCA GGCTGGGAAG AGAAAGCAGG CTGGAAGTTG AAGGGACTTC CCGGTTGCCT CAACGCCCGC GCCGGACAGG TCTTGAAAGC CGCCGTCGAG GCCGCGTCTC AGTCTTCGCC TCTGCTCGGC AAAGACGGCG CCGAAAAGCT GATTCGGTCT GCGTTCGTCA GCCTTGCAAT CCTCGACGAG AAAGGGATCG TTCGGCGCAA TCTCGTGACC ATCGACGAGA TACTCGACTC AAGCGGCTTG GTCGAACGGG CCACCCGCAG CAACAGGACA GAGACTCGAT ATGTGCTCGC AGGTTGGCGC ATTGACGCGA CCACGGTCGA GAAACCCGAC TCAGCTGAGG AGTCCGAACG GGCCGCTCGC AACCTGGCAG AGATTCGAAA TGCGCTCATA GGGATGCTCG CGCAGTTCAA GGCGGCTTCG CTGGTCGGCT CCAAGCAGAC AGCCAAGGGC GAACTCTTCG ATATCAATCA CGAAGCGCTG GTTCGAAACT GGGATACCTG CGCCAGCTGG ATCAGCCAGG CAAAACTAGT CAAGGATCGG CTGCGCGCCA TCGACGAGAA GATCCGACAG ACAAACGTCG CGCAAAAAGG TTGGCTTGCG CGGTTCACCA ATCTGCTGTT CGCAACTGAT CTCAGCTCGG CCCACGAGCA GGTCGGAAGC GAAACGGCCA AAGCCCTCCG CGACGACGTG TTCGGCGATC AGGCGACTTT TAGCAAATCC TGGGCGCGGC ACGTTCTGGG GAACGACAAT GTGGCTCAAA TTGGCCACCG CGTTGCCGAT GCACAGCGCT TCGTTGACAA TCCGCTGCAT AGATACCGCC CGCTGGCAGT GTTCATCGCT GCGCTTGCTT TGGCCGGACT TTTGACCATC GTCACTAGGC AGCTATTTAT TCGCGATATG GAGCAGTTCG TTAATCTTCA GAGCATCGTC CGACGGACCG ACCTTTCCCA GGGAACCGTT ACGCCGGCCG GACCGCCTGA GACTTACGCA ACATTCAAAA TTGCATTTGG CAAACTGAAC AGAGTACTCA CACCCGATGT CATGCGACGG CCTTTGGCCG ATATGCTTGT CGGCCTCGAG GGAAACTGGA GGTCGCATTT TGGGCGCTCC ATATGGCTCA AATCTTCGGC GGCACAACAT CCGACCTCGA TGAGCCTCCA CGGATCAGTG GAGAGCAAGA AGGCGATTTG CATCACCCCG CGCCCGGAGA GAAAGCTACA AATCCGGAAG GATGAGCGAA CCCAAAAGAA CGAACTCGAA TGGGGCGCGG TCCAAATAGG CTCCAAAACA GATTCGGGGT CTGTCCAAGC CGCTCAGAAG ACGGTCTGGA GACCAACAAC TCCAGCAATC GACAAGGAAC CGACAGTCAA ATCGAATCTA TACGGCGGAG AGGAATGGCC CGATGGCTCG GTGGTCTGCA CGTCTCCCGA CGGAAGGTGG CAGCTGAAAT GGATCAATGA TAAGCCTACT CCGAAATGGC CTTCCATCCG CTATACCCTG GTGAACAAGC TGACACCGCC TCAAGGCGAG CCAGGATTCT ATGTTGAGGT AGGGAGCGAG CGATACTTCA CTGACGAGCA AACGAGCGGT GCATACGGAC GCGAGCTTCA ATCGTCGCTT CCGACCGTGT CAGACAGTGT AAGCAATCCC GGGAACACAA ATACCAAGGC CATTCAATTT GTGCGCGATG GGCATTGGGT CGGCTTCTCG ATCCCGGTGG AGGGCGGCAA GGCCGTTACG CTATGGACCA CCGAAGGCAT CGCTGAACCG CAGGAAACCT ATGCACCTCC AAGCAGCAGC AAGCCTTGCG CACGGAAAAA TGGCGACCTG CTTCGCTGTA CGATTGGGTC TCTCACGTAC GGCGAGAATG TCTATGATGT CCGGGTAACT TCCTTTCCCC CAAGCCAGGA TTCGCCAGAT TGCTTCTCTG CGGGGTCAAC CTGCGCAACG TTCATCGATC TCCTTTTCGT CGATCAATCG ACTGACAAGG TCGGTCAACC GGCTGACAAG AAAGTCCGCG CGGTCGATCC ACAAGACGAC ATCATCAGTG CCAGATTAAG CTTCCCCAAG AGGCAGATCA AAAGCGGAAC AATCACCGCG GAGGGTTATC TGGTTTTGAC GGATATCGCT AACCAGACGT GGCGGTATTT GATCGACGGC AACAAGCTGG CCGAGCTGCA AAAGGATACA TGGCGCGAGA GCAATCTCGA AAACGCTGCG TGGAGCGCTC CCTGCAGGAA GTTGCAATGC GACGACATGA TCCGGCGCTG A
|
Protein sequence | MTDTLAAASI PPVSASIGNL TSLKPYPGLR SFTQDEALLF FGREVQVRQI RDILANRNLL VVLGGSGSGK SSLVRAGLLP KLNSTAPIPK RSGAWYAVEF RPLTNPIAEL FEAIFVQIFQ PLLNTPSSVA GTGTEGQPTQ TGLEENRRLA AVSAALDIDP ALQPSEDIEK RCRERLRERL FKGRTIDIGS LFAFAEQTIV VLDEKLAAGP RSGKANLLIL IDQFEEVFSL PAENREDGQS MVMSLVTGIQ AFRPDNLFLI VTMRNEWLHR CSEIPGVAEA MNGSTYLVDL LNDSEIRNII VEPARSVLRA ARLDPGPSSR GPYSLDVLRL LQQAFDDTAA VPDASDRLPL LQHLLQLLWD SADPTRPHFS IETRHLEAIP GWEEKAGWKL KGLPGCLNAR AGQVLKAAVE AASQSSPLLG KDGAEKLIRS AFVSLAILDE KGIVRRNLVT IDEILDSSGL VERATRSNRT ETRYVLAGWR IDATTVEKPD SAEESERAAR NLAEIRNALI GMLAQFKAAS LVGSKQTAKG ELFDINHEAL VRNWDTCASW ISQAKLVKDR LRAIDEKIRQ TNVAQKGWLA RFTNLLFATD LSSAHEQVGS ETAKALRDDV FGDQATFSKS WARHVLGNDN VAQIGHRVAD AQRFVDNPLH RYRPLAVFIA ALALAGLLTI VTRQLFIRDM EQFVNLQSIV RRTDLSQGTV TPAGPPETYA TFKIAFGKLN RVLTPDVMRR PLADMLVGLE GNWRSHFGRS IWLKSSAAQH PTSMSLHGSV ESKKAICITP RPERKLQIRK DERTQKNELE WGAVQIGSKT DSGSVQAAQK TVWRPTTPAI DKEPTVKSNL YGGEEWPDGS VVCTSPDGRW QLKWINDKPT PKWPSIRYTL VNKLTPPQGE PGFYVEVGSE RYFTDEQTSG AYGRELQSSL PTVSDSVSNP GNTNTKAIQF VRDGHWVGFS IPVEGGKAVT LWTTEGIAEP QETYAPPSSS KPCARKNGDL LRCTIGSLTY GENVYDVRVT SFPPSQDSPD CFSAGSTCAT FIDLLFVDQS TDKVGQPADK KVRAVDPQDD IISARLSFPK RQIKSGTITA EGYLVLTDIA NQTWRYLIDG NKLAELQKDT WRESNLENAA WSAPCRKLQC DDMIRR
|
| |