Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_0172 |
Symbol | |
ID | 6201746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 203510 |
End bp | 205195 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641704167 |
Product | hypothetical protein |
Protein accession | YP_001831318 |
Protein GI | 182677172 |
COG category | [S] Function unknown |
COG ID | [COG3798] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0383324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGC CAACGCCAAG CGAAGCGAAG ACTCTGCCGC TCTATGAAGA AACGATATCG GTGGGGAAGA GACGTGTGCG TACTGGTACG ATCCATGTGC GAACCGTCAC CGACACGATA CGGGATCATG TGGATATCGA GTTGTCGGAA GACAGGATCG ACGTGAAGCG TGTTCCCGTC GATCGCGTCG TCGATCATGT CCCAGAGGTA CGGATCGAAG GCGATCTTAT GATCGTTCCT GTCCTGAAAG AAGTTTTCTT CGTCGAAAAG CGCCTCGTCC TGGCTGAGGA AATTCACCTC CGGCGCTGGA TTGTCAGCAA ACATGTGAAA ATTCCAGTCG AACGGCGCAG CCAGCGTGCC CTTGTGCAGA CCCCGTTCCC GGAAGCCGCG TCCGATACAG GCGGTGAGGA CGATTCAACG ATCAATGGGA GGCCATCCAT GACGGAGATC AATGAGTCCG TACACTATCG TTTAATTACG GCGTTCTACG ACAATGAGGC CGCGGCGCAA GCGGCCGTCG ACAAACTGCT CGCTATCGGT ATTCCCAATG ATGATATTCA TTTCATCCAC GGCCGCGCCA TGCATGCGGC GATCGTGGAA GAGAACAAAG GATTCTGGGA AATCATCAAA TATCTTTTCC TGCCGGAAGA GGACCGGCAT ATTTATGCCG AAGGCTTGAA ACGGGGTGGT TATCTCGTTT CGGTTAGAGC GGATGACGCC ACTTACGGGA AAGCTTTCGA TATCCTTGAT TCCGAAGGGA CCCTTAACCT GACGGAACAG GAAGCGCTCT GGCGAGCCGA AGGATGGACG GGCTATGAGG GGGCCCTTGA GCAGCCGGTC GTCGAACAGC CTGCCCTTGA ACAGCCTGCC CTTGAACAGT CGGTCGTTGA ACAGCCGATC GTGGCGGGTG AGGCTGAGGC AGCAGCAAGT GTCGTTTCCA CCAGCGCGGT CTCAGTGACA CCGCTCGCGC AGACTGTTTC GGAGCCTGCT AGCTTGCAAA CTGGCACTCC TTTGCAAACT GGCGCTCCTT TGATGGAACA ACCTGCCGAA AGTAAAGCCT TCGAAAATGA AGAGATGGTG ATTCCGATTG CCGAAGAACT CTTGCACATT GGCAAGCGTG AGATCGACAA CGGCCGTGTT CGTGTCCACA GCTATACTGT CACAAAGCCG GCAAGCGCTT CGATCAAATT GCGGGAAGAA AATATCACGA TCGAACGGCA TCCCGTCGAT CGGCCGATCA CGGCCACGGA TAATGCGTTC GCGGAAAGAA CAATCGAACT TGATCAGCAT GCTGAAGTCC CCGTCGTGGA GAAGGACGTT CATGTGAGCG AGGAAATTCG CCTCGCCAAG GATGTTCGCG AACATCAAGA AACGGTTTCT GATTCCGTGC GCGAGACCAA AGTGGATATT GAGGACACAC GCCGGAGCAG ACTTGCAGCA AGCACTGGCC TCGTTGGGTC GGACATGGTT TCCTATGCCG ATAAGATTCA CGACCATATG AATGTCATGG CCTCGGACGG TCAGCTTATC GGTGTCGTCG ATCATCTCGA GGGTGATCGG ATCATGCTGA CCAGCAGCGA CTCTCCCGAT CACCTGGAAC ATTTCATTCC TCTCGCCTGG GTGAAGACGA TTGGAGCGGA TGTGGAACTC GACAAACCGG CCAATATGGT GAAGGCCTCC TGGTAA
|
Protein sequence | MNKPTPSEAK TLPLYEETIS VGKRRVRTGT IHVRTVTDTI RDHVDIELSE DRIDVKRVPV DRVVDHVPEV RIEGDLMIVP VLKEVFFVEK RLVLAEEIHL RRWIVSKHVK IPVERRSQRA LVQTPFPEAA SDTGGEDDST INGRPSMTEI NESVHYRLIT AFYDNEAAAQ AAVDKLLAIG IPNDDIHFIH GRAMHAAIVE ENKGFWEIIK YLFLPEEDRH IYAEGLKRGG YLVSVRADDA TYGKAFDILD SEGTLNLTEQ EALWRAEGWT GYEGALEQPV VEQPALEQPA LEQSVVEQPI VAGEAEAAAS VVSTSAVSVT PLAQTVSEPA SLQTGTPLQT GAPLMEQPAE SKAFENEEMV IPIAEELLHI GKREIDNGRV RVHSYTVTKP ASASIKLREE NITIERHPVD RPITATDNAF AERTIELDQH AEVPVVEKDV HVSEEIRLAK DVREHQETVS DSVRETKVDI EDTRRSRLAA STGLVGSDMV SYADKIHDHM NVMASDGQLI GVVDHLEGDR IMLTSSDSPD HLEHFIPLAW VKTIGADVEL DKPANMVKAS W
|
| |