Gene BURPS1106A_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1072 
Symbol 
ID4899586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1052015 
End bp1053052 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID640134302 
Productputative lipoprotein 
Protein accessionYP_001065352 
Protein GI126452262 
COG category[S] Function unknown 
COG ID[COG5430] Uncharacterized secreted protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGTTA TACTGAAGCC GGCTCGGATG CTTGAGATAA GTCAAAAAAT GACGATAACG 
AATATCAGGC AGCGCGATTC GAAGCAATGG CGGCTCGCGC TGCTCGTCGC GTGGCTGCTC
GCGTGCCCGT GGGGGGCGGC GCACGCCGAG ACGTGCTCGG TCACGACGCC CGCGCCGAAT
TTCGGCTCGG TCGATCCGAT CACGCTCGCC GCCGTGTCGA CGACCGCGAC GATGACGGTT
ACCTGCACGT GGTCGGCCGT CACGCTCACG CCGAACGTGC TCGTCTGCCT GAACCTCGGC
GGCACCAGCC CGCGCTATCT GACGAACGGC TCGAACCAGA TGCAGTACGA CCTGTACCAG
GATTCGGGGC ACACGGTGAG CTGGGGCTCG TCGTACTACG GCACGACGCC GATTTCGCTC
ACGCTCGTGA AGCCCGCGCT CAGCACGAGC GCGAGTTCGA CCGTCACGAT CTACGGCCAG
ATCGCCGCGA ACCAGCCGAC CGTGCCGACG GTCGGCAACG CGAGCACCAC CTATTCGCAG
ACGTTCGGCG GCAACACGAC ATCGCTGAAC TACAACTTCT ACACGCTCGC GCCGCTGCCG
TGCGCGTCGC AATCGTCGTT CGGCACGTTC GCGTTCACCG CGAGCGCGAC CGTCGTCAAC
GATTGCTTCA TCAACGCCAC CAACGTCGCG TTCGGCTCGA CGGGCGTGAT CCAAGGCGCG
CTGACGGCGA CGGGCACGAT CAGCGCGCAG TGCACGAACG GCGACGCGTT CCGGATCGCG
CTGAACGGCG GCGCGAGCGG CAACGTCGCC GCGCGCGCGA TGCAGCGCAC GGGCGGCGGC
GGGGCCGTCA ACTATCAGCT GTATCTCGAC GCCGCGCATT CGACGATCTG GGGCGACGGC
ACGGCCGGCA CGTCGACGGC GACGGGCACG GGCAGCGGGC TGTCGCAGTC GCTCACCGTG
TACGGCCAGG TGCCCGCGCA GACCACGCCC GCGCCCGGCA CCTACAGCGA CACGATCACC
GCAACGATCA CGTTCTGA
 
Protein sequence
MMVILKPARM LEISQKMTIT NIRQRDSKQW RLALLVAWLL ACPWGAAHAE TCSVTTPAPN 
FGSVDPITLA AVSTTATMTV TCTWSAVTLT PNVLVCLNLG GTSPRYLTNG SNQMQYDLYQ
DSGHTVSWGS SYYGTTPISL TLVKPALSTS ASSTVTIYGQ IAANQPTVPT VGNASTTYSQ
TFGGNTTSLN YNFYTLAPLP CASQSSFGTF AFTASATVVN DCFINATNVA FGSTGVIQGA
LTATGTISAQ CTNGDAFRIA LNGGASGNVA ARAMQRTGGG GAVNYQLYLD AAHSTIWGDG
TAGTSTATGT GSGLSQSLTV YGQVPAQTTP APGTYSDTIT ATITF