Gene BURPS668_A1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1700 
Symbol 
ID4887223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1651544 
End bp1652983 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content62% 
IMG OID640131638 
ProductStAR 
Protein accessionYP_001062695 
Protein GI126443205 
COG category[S] Function unknown 
COG ID[COG4529] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.91604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGTTT CTATCGTGAT CGTGGGAGGT GGATTCAGCG GTGCGTTGAC TGCCATTCGG 
CTTGCGTGCG CGGCCTTGCC GGTCGGGACT TCGCTCACGT TGGTGGATGA ATACGGGCAA
TTCGGACGTG GCGTTGCCTA CCGCGCCGAT GCCGAGGGAT GGCTACTGAA TGCGCCGGCG
CGAGTGCTCG CCGTGCCTCC CGACAGGCCG CACGATTTCG TCGACTATTG CAATCGGCAT
GGCCATCCGG CGAGCGCCGA CAGCTTCATG CCTCGAGCGC TGTACGGTGA CTATCTGGCT
GACCAACTGC GACGAGTCGA GAACGTGGCC GTTTCCCATC GATTGCGCAA GATCGAGGAA
CGGGTCGTCG CGATCGCGTC GGATCGCTCT GGCGGCATGC ACGTTCGTCT ACGCGGCGGT
GAGGTGTTGC AGGCTACCGC CATTGTACTG GCGACCGGCC CCGGCCGTCG CAACGGGGGC
GACGCCACCG GGATCGACGT GTCGGCACTT GGCCCGCACT ATTTTTCGGA CCCGTGGGAA
GTAGCGGTTC TTCGCGACAT GCCGGCGGAA GGTCACTTTT TTATTCTGGG TTCGGGGCTG
ACCTCGGTCG ACGTAATCAG CGCCTTGCAA CGTCGCAGTC CGCGTAGCCG ATTCACGGCG
ATGTCGCGCC GCGGCTTGAT TCCTCAATCC CATCAACCTT GCGTCCTGCC ATTGACATCG
GCAGTCAAGT CCGAGCTTTC GTCGGGCCTG TTGGTTCCGC CACGACACGC TCTGGCGGTG
CTTCGCAAGA CCGTGCGCGA ACATACCGGT TCGGGGGGGG ACTGGCGCGA AGTGATCGAC
AGCATTCGCC CCGCCATCCC GCAGATATGG TCGCACTGGT CCAACGCTGA GCGGCGTGCG
TTTGTCCGGC ACCTCGCCGC CTATTGGGAT ACGCATCGGC ATCGTTGTGT GCCCGAAACG
ATGTCCATTT TAACGAGGTT GAAGAAGGAA AATCGCCTCA CGATGCTGGC GGGGCGACTC
GAAGCCGCGC GCCTCGAAGC AGAGGGGCTT TCCTTGACGG TCCGATTGAG AGCCACAGAT
GCATCCCGAG CGGTGCATGC GTCCTATCTC GTCGACTGCA CGGGGCCACC ATCGCGCGGC
GTATATCCCT CTGATCCGCT CTACCGGCAA TTGCAGCGCG ATGGTCTCGC CGAATTCGAC
GACAACGGCC TCTGTGTTGA CGACGAGTAC CGTATCGCGA CCAACGCTTA TTGCAGGAAT
CAGGCTCTCT TTTATATCGG CCCGCACCTG AAGCGACGCT ACTGGGAAGC GACGGCGGTC
CCCGAACTGA TGGGGCATGT GGCACGGCTT GTGGCCGTGC TCGAGCGCAC GCTTGTTGCC
GCTGCTTCGG CGCAGGCAAG GGCGCTAGAC CAAGATACGT GTCACGTCGA AAGGTGGTGA
 
Protein sequence
MPVSIVIVGG GFSGALTAIR LACAALPVGT SLTLVDEYGQ FGRGVAYRAD AEGWLLNAPA 
RVLAVPPDRP HDFVDYCNRH GHPASADSFM PRALYGDYLA DQLRRVENVA VSHRLRKIEE
RVVAIASDRS GGMHVRLRGG EVLQATAIVL ATGPGRRNGG DATGIDVSAL GPHYFSDPWE
VAVLRDMPAE GHFFILGSGL TSVDVISALQ RRSPRSRFTA MSRRGLIPQS HQPCVLPLTS
AVKSELSSGL LVPPRHALAV LRKTVREHTG SGGDWREVID SIRPAIPQIW SHWSNAERRA
FVRHLAAYWD THRHRCVPET MSILTRLKKE NRLTMLAGRL EAARLEAEGL SLTVRLRATD
ASRAVHASYL VDCTGPPSRG VYPSDPLYRQ LQRDGLAEFD DNGLCVDDEY RIATNAYCRN
QALFYIGPHL KRRYWEATAV PELMGHVARL VAVLERTLVA AASAQARALD QDTCHVERW