Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1700 |
Symbol | |
ID | 4887223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1651544 |
End bp | 1652983 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640131638 |
Product | StAR |
Protein accession | YP_001062695 |
Protein GI | 126443205 |
COG category | [S] Function unknown |
COG ID | [COG4529] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.91604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGTTT CTATCGTGAT CGTGGGAGGT GGATTCAGCG GTGCGTTGAC TGCCATTCGG CTTGCGTGCG CGGCCTTGCC GGTCGGGACT TCGCTCACGT TGGTGGATGA ATACGGGCAA TTCGGACGTG GCGTTGCCTA CCGCGCCGAT GCCGAGGGAT GGCTACTGAA TGCGCCGGCG CGAGTGCTCG CCGTGCCTCC CGACAGGCCG CACGATTTCG TCGACTATTG CAATCGGCAT GGCCATCCGG CGAGCGCCGA CAGCTTCATG CCTCGAGCGC TGTACGGTGA CTATCTGGCT GACCAACTGC GACGAGTCGA GAACGTGGCC GTTTCCCATC GATTGCGCAA GATCGAGGAA CGGGTCGTCG CGATCGCGTC GGATCGCTCT GGCGGCATGC ACGTTCGTCT ACGCGGCGGT GAGGTGTTGC AGGCTACCGC CATTGTACTG GCGACCGGCC CCGGCCGTCG CAACGGGGGC GACGCCACCG GGATCGACGT GTCGGCACTT GGCCCGCACT ATTTTTCGGA CCCGTGGGAA GTAGCGGTTC TTCGCGACAT GCCGGCGGAA GGTCACTTTT TTATTCTGGG TTCGGGGCTG ACCTCGGTCG ACGTAATCAG CGCCTTGCAA CGTCGCAGTC CGCGTAGCCG ATTCACGGCG ATGTCGCGCC GCGGCTTGAT TCCTCAATCC CATCAACCTT GCGTCCTGCC ATTGACATCG GCAGTCAAGT CCGAGCTTTC GTCGGGCCTG TTGGTTCCGC CACGACACGC TCTGGCGGTG CTTCGCAAGA CCGTGCGCGA ACATACCGGT TCGGGGGGGG ACTGGCGCGA AGTGATCGAC AGCATTCGCC CCGCCATCCC GCAGATATGG TCGCACTGGT CCAACGCTGA GCGGCGTGCG TTTGTCCGGC ACCTCGCCGC CTATTGGGAT ACGCATCGGC ATCGTTGTGT GCCCGAAACG ATGTCCATTT TAACGAGGTT GAAGAAGGAA AATCGCCTCA CGATGCTGGC GGGGCGACTC GAAGCCGCGC GCCTCGAAGC AGAGGGGCTT TCCTTGACGG TCCGATTGAG AGCCACAGAT GCATCCCGAG CGGTGCATGC GTCCTATCTC GTCGACTGCA CGGGGCCACC ATCGCGCGGC GTATATCCCT CTGATCCGCT CTACCGGCAA TTGCAGCGCG ATGGTCTCGC CGAATTCGAC GACAACGGCC TCTGTGTTGA CGACGAGTAC CGTATCGCGA CCAACGCTTA TTGCAGGAAT CAGGCTCTCT TTTATATCGG CCCGCACCTG AAGCGACGCT ACTGGGAAGC GACGGCGGTC CCCGAACTGA TGGGGCATGT GGCACGGCTT GTGGCCGTGC TCGAGCGCAC GCTTGTTGCC GCTGCTTCGG CGCAGGCAAG GGCGCTAGAC CAAGATACGT GTCACGTCGA AAGGTGGTGA
|
Protein sequence | MPVSIVIVGG GFSGALTAIR LACAALPVGT SLTLVDEYGQ FGRGVAYRAD AEGWLLNAPA RVLAVPPDRP HDFVDYCNRH GHPASADSFM PRALYGDYLA DQLRRVENVA VSHRLRKIEE RVVAIASDRS GGMHVRLRGG EVLQATAIVL ATGPGRRNGG DATGIDVSAL GPHYFSDPWE VAVLRDMPAE GHFFILGSGL TSVDVISALQ RRSPRSRFTA MSRRGLIPQS HQPCVLPLTS AVKSELSSGL LVPPRHALAV LRKTVREHTG SGGDWREVID SIRPAIPQIW SHWSNAERRA FVRHLAAYWD THRHRCVPET MSILTRLKKE NRLTMLAGRL EAARLEAEGL SLTVRLRATD ASRAVHASYL VDCTGPPSRG VYPSDPLYRQ LQRDGLAEFD DNGLCVDDEY RIATNAYCRN QALFYIGPHL KRRYWEATAV PELMGHVARL VAVLERTLVA AASAQARALD QDTCHVERW
|
| |