Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0652 |
Symbol | |
ID | 3844546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 762115 |
End bp | 763644 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637837957 |
Product | cytochrome c family protein |
Protein accession | YP_438851 |
Protein GI | 83717174 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.422125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCAT GGGGTAACAC GCTTGCTCGA TGCGCGATCG CCGGGCTTTT CGCTGCAGCC GCGTTCGCGC AGATTCCGCC TGCGAGGCCC GCATCCGTTT CGTCCACTCA CGCCGACGCG CCACGGATCA TCGACGATCT GCCGGCACGC TACTCCGCCG ACCTTGACGC TGCGACGCGC GCGCAGGTGG AGCGCGGACG TTACATCGCG CGACTGGGAG ACTGCGTCGC GTGCCATACC GGCGACAAGT CGAAGCCGCT GGCGGGCGGC CTGGCGTTGC AGACGCCGTT CGGCGAGCTG CACTCGACCA ACATCACGCC CGAGCCGCAG ACCGGCATCG GACGCTATAC GTTCGAGCAA TTCGATCGCG CGATGCGCAG CGGCGTCGCT GCCGACGGTC ATCATCTCTA TCCGGCGATG CCCTATCCGT CGTATGCGAA GGTGTCGCCG GAGGATATGC GGGCGTTGTA CGCGTACTTG ATGAAGGGCG TCGCGCCGGT TCGGCAGGCC AACCGCGCGC TTGCGTTGCG CTTTCCGTTC AATCAGCGCT GGGGCCTCGC GCTGTGGAAC TGGGCCTTCC TCGACGATAC GCCGTTCCAG CCCGACGCGA CCCGCAGCGC CGAATGGAAC CGGGGCGCCT ATATCGTGCA GGGGCTCGGC CATTGCGGCG CCTGCCATAC GCCGCGCGGA TTCGGCTTCC AGGAGAAGGC GATGTCGGGG GCGGGCTCCG CCGGCCCGTA CTTCCTCGCC GGCGAGACCG TCGAAGGCTG GCGCGCGCTG AGCCTGCGCG CGTTGTGGAC GCCGCAGGAC ACGGCCGAGA TGCTCAAGAC CGGACGCAAC CGCTACGGCA CGGTGTCGGG CAACATGGTG GACGTGGTCC AGCACAGCAC GCAGTACATG AGCGACGGCG ATCTGCTGGC GGTGGGCGCG TACCTCGAGT CGCTGCCCGC TGCCGGGCAC GACAAGCCGA TGCTGGTGCC GCAAGGCCCG GCGCGGGCGA TCGCGCCCGC GGGCTCGCCG GCCGCCGGTT CGGCGGCACG GGTGCCGGCC GATCTCTACA CGTCGCGGGG CGGCTTGGGC TATCTGCAGT TCTGCAACGA CTGCCACCGT TCGGACGGAG CAGGCGTGCG GGATGTCTTT CCGCCGTTGG CGGGCAACGC CGCGCTGCTG TCGAAAGACC CGTCCACGCT GGTTCACATC ATGCTGACCG GCTGGCGCTC CGCGCAGACG GCACGTCACG CGCGGGCGCT GACGATGCCT GCATTCGCGC AACTGAGCGA TCAGGAAATC GCGGAGATCC TGAATTTCGC ACGCAAGAGC TGGGGGCGCG CGGACGCCCG CGCGATCAGC GCGGGCACGG TCGGCCGCAT GCGCAAGCAG TTGCGCGCGG GCACCGGAAA CGACACGCGA TTCCAGACGC CGCGCCTCGC CGACATGCTC GCCGAATCCA ACGCGGGCCA ACTCACGCTG GGGGCGCGCC TGAACATCGA CACGCGCCGG ATGCTGCCGA AGCACGTCGG CAATGACTGA
|
Protein sequence | MKSWGNTLAR CAIAGLFAAA AFAQIPPARP ASVSSTHADA PRIIDDLPAR YSADLDAATR AQVERGRYIA RLGDCVACHT GDKSKPLAGG LALQTPFGEL HSTNITPEPQ TGIGRYTFEQ FDRAMRSGVA ADGHHLYPAM PYPSYAKVSP EDMRALYAYL MKGVAPVRQA NRALALRFPF NQRWGLALWN WAFLDDTPFQ PDATRSAEWN RGAYIVQGLG HCGACHTPRG FGFQEKAMSG AGSAGPYFLA GETVEGWRAL SLRALWTPQD TAEMLKTGRN RYGTVSGNMV DVVQHSTQYM SDGDLLAVGA YLESLPAAGH DKPMLVPQGP ARAIAPAGSP AAGSAARVPA DLYTSRGGLG YLQFCNDCHR SDGAGVRDVF PPLAGNAALL SKDPSTLVHI MLTGWRSAQT ARHARALTMP AFAQLSDQEI AEILNFARKS WGRADARAIS AGTVGRMRKQ LRAGTGNDTR FQTPRLADML AESNAGQLTL GARLNIDTRR MLPKHVGND
|
| |