Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0862 |
Symbol | |
ID | 3846516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1006768 |
End bp | 1009386 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637838165 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_439059 |
Protein GI | 83717943 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0163712 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCG TCAAACCCGA GTCGCTCGCC CTGCTGTGCC GGACGCTGCG CTTCGAAGGA ATCGACCGGC TGTCGATCGG CGCGCTCGCC TGCTTTGCGC TGCGTGCCGA CGCGCCCGCC GGCCCCGGCG ATCTCGCGCC CGAAGCCTCG CTCTGGCAGG TCGCGCGGCA GTGGCTCGGC GAACACGCGC CGCTCGACGA CGGCTTGCCG AAGCCGTCGG GCGAATTTCT CGTCTACGGC GATGCATGCG CGCCGCCGGG CCGTGACCGC GCCGCGCGCG CGCCGTTCGC GGTGCGCGCG CGCATCGGCG CGGCATGCAA GGAACGGCTC GTCGACGCGC GCGACGCCGC CGGCCGCGCG CTCGCCGAAT TTCGCGCGCT GCCGCCGTCG CATCCCGAGC GCTCGCGCGA TCTCGGGCCG TTCGACGAAC GCTGGCTGGC CGCGCGCTGG CCTCACCTGC CCGCCGGCAC GCGCGCCGAG CATTTCCACA CCGCGCCGCG CGACCAGCGT ATCGCCGGAT TCTGGCGCGG CGACGAAGAC ATCGAACTCG TCAACCTGCA TGCGGACCGG CCGGCCATCG CCGGCGCGCT GCCGCGCGTG CGGGCGCGCT GCTTCGTCGA GCGATGGGTG GGCGGCGTCG CGCGCATCGA CGCGTGCCCG ATGCGCGCGG AAACCGTCTG GCTGTTCCCC GGCGCGGCGT GCGGCATCGT CCTGTATCGC GCGCTCGTCG CGATCGACGA TGAAGACGGC GACGACGTCG TGCGCGTGAT CGCCGGCTGG GAACACGCCG ACGCGCCGCC ATTGCCCGAC GAAGCGTATA TCGGCCGGCC CGCGCCCGAA GACGAAGGAT CTCGCCCAGC GCTCGCCCCC GCCGCGGCGC CCGCAGCGAT TGCCGACGAC GATGCGCGCG CCGATGCCGG CGACGCCGCC GACCGCGCGC CGGGCGCGCC GGCATCGGCC GCGCACGCGC ATTCGCCGGC GGCCCCGGAA TCGTCCGCGG AACCGCCCGC GCCGGATCTG TCCGCGCTCG AACGGGATGC GGCCGCGCTT GCCGCGCAAA CCGACGCGCT GCTCGCCGCG GCCGGCCTGA CCGAAGCCGA CGTCGCGCGC CTGCTGCCGC CGCGCGATGC GCCCGCCGAC ATGACCCTGG ACGAACTCAC CGCGCTCGCG GCGGAACTCG ACGCACGAAC CGCGCAATGG CAAGCGCAGT ACGACGCGGC CGCAGCCGAA CGAGACGAGG CTTCATCGCC CGCTTCGCCG AATTCGGCGG CAGCGCACGA CGCATCGCTC GCCGACCTGC TTCGGCAAGC CGATGCGCAG ATCCGCGCGC TCGTCGATCA GCACGGCTTG TCGCGCGCGC AGATGGAGGC GGCCGCGCGG GACCGGCCTG AGCTTGCCGC GCTCGCCGAC GCGCTCGATG CGCTCGATGC GCCGCTCGAC ATCGATGCGC TGACGGCGGG CCTTGCCGCG CCCGCGGGCG ACGAAGCGAT CGTCGAACCG GATGCGCCGG CCGGGCCGGA CCGGCCAGCG GGCGCGGACC GGCCCGCCGA CGGCGCGCCC GCGTCGATGC ACGCCGCCGC GCCGTCGGCC GGCGACGCGC CGCCCGCGGA GCCGCTCACG CGCGAGCAGG TGATCGAGCG CCACGCGCGC GGGCTCGGCT TCGCCGGCCT CGACCTGAGC GGCCTCGATC TGTCGTCGGC CGCGCTCGAA CGCGCGGACT TGCGCGACGC CCGCATCGAA CGCACCTGCT TCGCCGGATG CCGGCTCCGC GGCGCGTCGT TCGAGCGCGC GCTGCTGTCG CGCGCCGATT TCTCGAACGC GGACCTGCGC GAGGCGACCT TCGTCGACGC GTCCGCGCCC GGCGCATCGT TTCGCGGCGC CGCGCTCGAT CGCGCGCGCC TGGCGCACGC CGACTTCACC GGCGCAGACT TCACGCGCGC GTCGCTCGCC GACGGCCATT GCGCGCACGC GCGATTCGAC GAGAGCGCGA TGACGCAGCT CGCCGCCGCG CGGCTCGACG GCGCGCATGC GAGCTTCGCG GGCTGCGCGC TCGATGCCGC CGACTTCACG TCGGCGCGCA TGCCGCGCGC GAACTTCCAG CACGCGACGC TCACGGCCGC CACGTTCGCG TTCGCGCAGT GCGACGGCGC CGAATGGTAC GGCGCGCAAG CGTCCGGCGC GCAGCTTCGC TCGGCGTCGC TGCGCGGCTC GCGCGCGGAC GCGTCGACAT CGTTCCGGCA AGCCGTCCTG AGCGGCGCGG CGCTCGACGA CGCGAACTGG GACGGCGTCG ACCTGCGCTA CGCGAATCTG CACAAGGCGA CGCTCGATCG CGCGAGCCTC GCGCGCGCGA TCGCGAGCGG CGCGCAACTG ACGCTTTCTC TCGCGCGGCG CGCGGATCTG ACGAAGGCCG ACCTCACGCA CGCGGACGCG CGCTTTTCGA ACCTGCAAGG CGCGTCGCTG CGCCGCGCGC GGCTCGACGG CACGCAACTG CAATCGAGCA ACCTGTACGG CGCCGACTGC TACGGCACCG CGCTCGGCCG ATCGCAGCTC GCCGGCGCGA ACGTCGAGCG AACGCTCTTC GTCGTGCCGG GCCGCCCCGA ACTCGCGTCA TCCCGCTGA
|
Protein sequence | MKIVKPESLA LLCRTLRFEG IDRLSIGALA CFALRADAPA GPGDLAPEAS LWQVARQWLG EHAPLDDGLP KPSGEFLVYG DACAPPGRDR AARAPFAVRA RIGAACKERL VDARDAAGRA LAEFRALPPS HPERSRDLGP FDERWLAARW PHLPAGTRAE HFHTAPRDQR IAGFWRGDED IELVNLHADR PAIAGALPRV RARCFVERWV GGVARIDACP MRAETVWLFP GAACGIVLYR ALVAIDDEDG DDVVRVIAGW EHADAPPLPD EAYIGRPAPE DEGSRPALAP AAAPAAIADD DARADAGDAA DRAPGAPASA AHAHSPAAPE SSAEPPAPDL SALERDAAAL AAQTDALLAA AGLTEADVAR LLPPRDAPAD MTLDELTALA AELDARTAQW QAQYDAAAAE RDEASSPASP NSAAAHDASL ADLLRQADAQ IRALVDQHGL SRAQMEAAAR DRPELAALAD ALDALDAPLD IDALTAGLAA PAGDEAIVEP DAPAGPDRPA GADRPADGAP ASMHAAAPSA GDAPPAEPLT REQVIERHAR GLGFAGLDLS GLDLSSAALE RADLRDARIE RTCFAGCRLR GASFERALLS RADFSNADLR EATFVDASAP GASFRGAALD RARLAHADFT GADFTRASLA DGHCAHARFD ESAMTQLAAA RLDGAHASFA GCALDAADFT SARMPRANFQ HATLTAATFA FAQCDGAEWY GAQASGAQLR SASLRGSRAD ASTSFRQAVL SGAALDDANW DGVDLRYANL HKATLDRASL ARAIASGAQL TLSLARRADL TKADLTHADA RFSNLQGASL RRARLDGTQL QSSNLYGADC YGTALGRSQL AGANVERTLF VVPGRPELAS SR
|
| |