Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3821 |
Symbol | |
ID | 5156274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4003103 |
End bp | 4004323 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640558659 |
Product | hypothetical protein |
Protein accession | YP_001239803 |
Protein GI | 148255218 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.133528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCCGC CGCCGCGTCG GTCCGATCGG GCCCGCAATC CGTTCGTCGT CGTCGGCAAT GCGATCATCA CCCTGCTGCT TCTGGCTATG ATCGGAGCGG GGGCTGCGTA TTACTACGGA CGGCAGATCC TGGAGGCTCC GGGACCGTTG CAGGACGACA AGATCGTCAA CATTCCTCAG CGCGCGGGCA AGCGCGACAT CGCGGATGTG CTGGCCCGCG AGGGCGTGAT CGACGTCAAT CCCTGGATCT TCATCGGTGG CGTGTATGCG TTGAAGGCGA GTTCCGACCT CAAGCCCGGC GAGTATGCGT TCCAGAAGAA TGCGTCGCTG CGCGACGTGA TCGGCGTGAT CGTCGAGGGC AAGGTCGTGC AGCATGGGGT CACGATTCCC GAAGGTCTGA CCTCCGAGCA GATCGTGGCC CGCCTCTCTG ACAACGAGAT ATTCACCGGC AGTGTCCGTG AAATTCCGCG CGAGGGAACG CTCCTTCCTG AGACCTACAA ATTCCCGCGT GGCACGAGTC GCGAGCAGGT CATTCAGCGC ATGCAGCAAG CGCAGAAGCG TGTGCTGGCC GAGATCTGGG AGCGCCGCAG CCCTGATTTG CCCGTGCGGA CGCCAGAACA GCTGGTGACG CTCGCTTCGA TCGTCGAGAA GGAAACCGGC AAGCCCGACG AACGCAGCCG GGTCGCCGCC GTTTTCGTCA ATCGCTTGAA GCAGAAGATC AAGCTGCAGT CGGATCCCAC GATCATCTAC GGCCTCGTCG GAGGCAAGGG GACGCTGGGA CGGCCGATCA AGCGCAGCGA AATCACACAG CCTTCGCCGT ACAACACCTA TGTGATCGAG GGGCTGCCCC CGGGGCCGAT CTCAAACCCG GGCCGGGCGT CCCTGGAAGC CACGGCCAAC CCGGCGCGGA CACGTGACCT CTATTTCGTG GCCGACGGCA CCGGTGGTCA CGCCTTCACC GAGACCTACG ATCTGCATCA GAAGAACGTC GCCAAGCTGC GGGCGATGGA GCGGCAGACC CAGAACGACA CGGTCGAGCC TGAGGAGGCG CCGGCGGCCA CCGCAGCGGG CGCCGTTGAT CCGGCCGCCG CGGCAGCAGC ACCGCGTGCG CCGACGGGGG CCAAGAAGCC GCCGGCCAAT CGCGCATCGG GCGCTCCGGC GCCTGCTGCG GGCGGCAGGC AGGGCGCGGC GCAATCGAGC CCGCCGGTCG TTCAGCAATA A
|
Protein sequence | MPPPPRRSDR ARNPFVVVGN AIITLLLLAM IGAGAAYYYG RQILEAPGPL QDDKIVNIPQ RAGKRDIADV LAREGVIDVN PWIFIGGVYA LKASSDLKPG EYAFQKNASL RDVIGVIVEG KVVQHGVTIP EGLTSEQIVA RLSDNEIFTG SVREIPREGT LLPETYKFPR GTSREQVIQR MQQAQKRVLA EIWERRSPDL PVRTPEQLVT LASIVEKETG KPDERSRVAA VFVNRLKQKI KLQSDPTIIY GLVGGKGTLG RPIKRSEITQ PSPYNTYVIE GLPPGPISNP GRASLEATAN PARTRDLYFV ADGTGGHAFT ETYDLHQKNV AKLRAMERQT QNDTVEPEEA PAATAAGAVD PAAAAAAPRA PTGAKKPPAN RASGAPAPAA GGRQGAAQSS PPVVQQ
|
| |