Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5394 |
Symbol | thuR |
ID | 7381498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 395436 |
End bp | 396473 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643649006 |
Product | transcriptional regulator LacI family |
Protein accession | YP_002547243 |
Protein GI | 222106452 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00772538 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGCA TACATCAGCT TGCAAAACAT CTCGACATCT CTATCGGGAC CGTGTCGCGC GCGCTGAATG GACGCCCCGA CGTCAACGCG GAAACGCGAC GGCGGGTACT GGAGGCCGCT GAGGAACTGG GCTATGTCGC CAATCAGTCG GGCCGCAGCC TGCGCAAGGG GTCAACCAAC GTCATCGGAC TGATGATCGA GTCCGGCAAG GACAATGCCG ACAACAGCGA CAACTTCTTT TTCGGCGTCA TGGACGGTTT GCAGACGGTG TTTGCCCGGC ACAATCTCGA CCTCGTCCTG CTCCCCTGCC CTGCCGACGA AGACCCCCTG GAATATCTTC AGCGCATGGT GGCCCGCCGC CTGGTCGATG CGATGATCAT TTCGGCTACG CAGAGGGTCG ATAAGCGTAT TGATCTGTTG ATCAAAACCA AAATACCCTT TGTAACCTTG GGGCGCAGCA CGTCGGGCGG CAGCCATACC TGGATCGACC TCGATTTTGC CGGTGTCGCC AATTCCGCCG TGGATCGACT GGTGTCGAAA GGTCACCGTC GCATCGCCAT CGCCGCCCCC TGCACTGATA TCAACCTCGG CACCGTGTTT ACCGACGCCT ATCAGGCCGC GCTGGAGCGC AACGGCCTGG CCTTCGATCC GGCCTTGGTG CTGCGGGCGA AATCCAGCGA AAGCGGCGGC TATAGCGTGG GCAGCGAGCT GCTGGCGCTC GATCCGCGCC CGACCGCCAT CATCCTGATT TACGAACTCA TGGCGATCGG CCTCTACAGG CGGCTGGCTG AGGCCGGCGT CATCCCCGGC CGGGACATGG CCGTCATCGG GTTTCGTGAA GCGCCCCGCG CCCGGTTTCT GCAACCCGCT CTGACCTGCT ACCGCCTTTC CCTGGAGGAT CTCGGGGTGG AACTGGCCGA AACCTTGCTC GCCTCAATGC CGGATTATGC GGAAACATAC AGAACCCATG CCCGCAATCG CCTCTGGCCA CTAGAACTGG TTCCGGGCGA AAGCGATGCC TTCGACCTGC TGACATAG
|
Protein sequence | MKGIHQLAKH LDISIGTVSR ALNGRPDVNA ETRRRVLEAA EELGYVANQS GRSLRKGSTN VIGLMIESGK DNADNSDNFF FGVMDGLQTV FARHNLDLVL LPCPADEDPL EYLQRMVARR LVDAMIISAT QRVDKRIDLL IKTKIPFVTL GRSTSGGSHT WIDLDFAGVA NSAVDRLVSK GHRRIAIAAP CTDINLGTVF TDAYQAALER NGLAFDPALV LRAKSSESGG YSVGSELLAL DPRPTAIILI YELMAIGLYR RLAEAGVIPG RDMAVIGFRE APRARFLQPA LTCYRLSLED LGVELAETLL ASMPDYAETY RTHARNRLWP LELVPGESDA FDLLT
|
| |