Gene BURPS668_A2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2131 
Symbol 
ID4886516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2070212 
End bp2072854 
Gene Length2643 bp 
Protein Length880 aa 
Translation table11 
GC content76% 
IMG OID640132068 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001063125 
Protein GI126442866 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCG TCAAACCCGA AACGCTCGCG CTGATGTGCC GCACGCTGCG CATCGAACGC 
GCCGACCGGC TGTCGATCGG CGCGCTCGCC TGCTTCGCGC TGCGCGCGCA CGCGCCCGAC
GGCCCCGGCG ATCTCGCGCC GGAGGCCACG CTCTGGCAGA TCGCCCAGCA ATGGCTCGGC
GCCCATGCGC CGCTCGACGA GGGCTGGCCG AAGCCGGCGG GCGAATTTCT CGTCTACGGC
GACGCATGCG CGCCGGCGGG CCGCGAGCAC GCGGGCGGCG CGCCGTTCGC GGTGCGCGCG
CGCATCGGCG CGGCATGCAA GGCGCGGCTC GTCGATGCGC GCGACCCCGC CGAGCGGGTA
CTCGCCGATT TTCGCGCGCT GCCGCCGTCG CATCCGCAGC GCGTGCGCGA TCTCGGGCCG
TTCGACGCGC GCTGGCTGGC CGAGCGGTGG CCTCACCTGC CTTCGGGCAC GCGCGCCGAG
CATTTCCATA CCGCACCGCG CGATCAGCGG ATCGCCGGGT TCTGGCGCGG CGACGAGGAC
ATCGAGCTCG TCAACCTGCA CGCGCAGCAT CCGATCGTCG CCGGCGCGTT GCCGCGCGTG
CGGGCGCGCT GCTTCGTCGA GCGCTCGGCC GGCGGCGCGA CGCGCGTCGA CGCGTGCCCG
ATGCGCGCGG AAACCGTCTG GCTGTTCCCC GGCGCGGCAT GCGGCATCGT CCTGTACCGC
GGGCTCGCCA CGATCGACGA CGAAGACGGC GACGACGTCT TGCGCGTGAT CGCCGGCTGG
GAAGATGCCG CCGCGCCGCC GTTGCCCGCC GACGCTTATC TCGGCCGGCC GGCCTCCGGG
GGCGCCGGCT CGCGCCCGAC GCCCGCGCCC GACGCCGCGC CCGTGGCGTC TGTGGTGCCC
GCCGCGCTCA CCGCCGAAGA AGCGCACGCC GACGAACATG CGCCGGGCGG ATCGGCATCG
GCCTCGCAGG CGCACTCGCC GGCAGCGCCC GAGTTCCCCG AAGCACCGCA CGCGCCGGAT
CTGTCCGCGC TCGAACGGGA AGCGGCGGCG CTTGCCGGGC AAACCGACGC GTTGCTCGCC
GGGCTGGGCA TCACCGAAGC GGACATCGCG CGCTTGCTGC CCGCGCGCGA CGCCCCCGCC
GAGCTGAATC TGGATCAGCT CGCCACGCTC GCGGCCGAAC TCGACGCGCA AACCGCACAG
TGGCAGGCGC AGCAAGCGGC GGCGGCCGTC GAGCGCGGCG ACGCGGCCTC GGCGACGCCC
GCCGCGCCGG CCACGCCGGA TGCCGAGGCC GCGCACGAAG CGTCGCTGGC CGACCTGCTT
CGGCAGGCCG ACGCGCAAAT GCGCGCCCTC GTCGAGCAGC ACGGCCTGTC GCGCGCACGA
ATGGAGGCGG CCGCGCGAAC CCTGCCGGAG CTCGCGCCCC TCGCGGGCTC GCTCGATGCC
CTCGACGCAC TCGATGCGCC GCTCGACGTC GATGCCTTGA CGGCAGGGCT CGCCGCCGCC
GGCGGCGACG CGGCGGCCGA ACCGGATACG CCGGCCGAAC CGAGCCCGCC GGCGCCCGCG
AACGAATTCG CCGCCGCCGC GCCCGCCCCC GCATCGTCCA CCGCCGCGCC GCCGGTGGAC
GATGCGCCGC CAGGGCCGCT CACGCGCGAG CAAGTAATCG AGCGCCACGC GCGCGGGCTC
GGCTTCGCCG GCCTCGACCT GAGCGGCCTG GACCTGTCGT CGGCCGCGCT CGAGCGCGCG
GACTTTCGCC GCGCACGCCT CGAACGCACC CGCTTCGCGG GCTGCCGGCT CGCCGGCGCA
TCGTTCGAGC GCGCGCTGCT GTCGCACGCC GATTTCTCGA ACGCGGACCT GCGCGACGCG
GTCTTCGCCG GCGCCTCCGC GCCCGGCGCA TCGTGGCGCG GCGCCGTGCT CGAGCGCGCG
CGCCTCGAGC ACGGCGACTT CAGCGGCGGC GACTTCGCGC AAGCGTCGCT CGCCGACAGC
CATTGCGCGC ACGCGCAGTT CGACGCGAGC GCGATGACGG CGCTCGTCGC GGCGCGCATC
GACGGCACGC ACGCGAGCTT CGCCGGCTGC ACGCTCGACG CCGCCGATTT CACGTCGGCG
CGCCTGCCGC GCGCGAATTT TCAGCATGCG ACGCTCGCGG ACGCGGCGCT CGCCTGCGCG
CACTGCGACG GCGCCGAATG GTACGGCGCG CAGGCGCCGC GTGCCCGGCT TCGCGCGGCG
TCGCTGCGCG GCTCGCGCGC GGACGCGTCG ACGTCGTTCC GGCAAGCCGA TCTGAGCAGC
GCCGCGCTCG ACGACGCGAA CTGGGACGGC GTCGACCTGC GCGGCACGAA CCTGCACGAG
GCGACGCTCG ACGGCGCGAG CCTCGCGCGC GCGAACGCGA GCGGCGCGCA ACTGACGCGC
GCGCGCGCAC GGCGCGCGGA TCTGACCCAG GCCGACCTCA CGCACGCGGA TGCGCGCTGC
TCGAACCTGC ACGGCGCATC GCTGCGCCGC GCACGGCTCG GCGGCACGCA ACTGCAATCG
AGCAACCTGT ACGGCGCCGA CTGCTACGGC ACCGCGCTCG CCCGGCCGCA GCTCGACGGC
GCGAATATCG AGCGCACGCT CCTCGCCGTG CCGGGCCGCC CCGAACTCGC CGCCTCCCGC
TGA
 
Protein sequence
MKIVKPETLA LMCRTLRIER ADRLSIGALA CFALRAHAPD GPGDLAPEAT LWQIAQQWLG 
AHAPLDEGWP KPAGEFLVYG DACAPAGREH AGGAPFAVRA RIGAACKARL VDARDPAERV
LADFRALPPS HPQRVRDLGP FDARWLAERW PHLPSGTRAE HFHTAPRDQR IAGFWRGDED
IELVNLHAQH PIVAGALPRV RARCFVERSA GGATRVDACP MRAETVWLFP GAACGIVLYR
GLATIDDEDG DDVLRVIAGW EDAAAPPLPA DAYLGRPASG GAGSRPTPAP DAAPVASVVP
AALTAEEAHA DEHAPGGSAS ASQAHSPAAP EFPEAPHAPD LSALEREAAA LAGQTDALLA
GLGITEADIA RLLPARDAPA ELNLDQLATL AAELDAQTAQ WQAQQAAAAV ERGDAASATP
AAPATPDAEA AHEASLADLL RQADAQMRAL VEQHGLSRAR MEAAARTLPE LAPLAGSLDA
LDALDAPLDV DALTAGLAAA GGDAAAEPDT PAEPSPPAPA NEFAAAAPAP ASSTAAPPVD
DAPPGPLTRE QVIERHARGL GFAGLDLSGL DLSSAALERA DFRRARLERT RFAGCRLAGA
SFERALLSHA DFSNADLRDA VFAGASAPGA SWRGAVLERA RLEHGDFSGG DFAQASLADS
HCAHAQFDAS AMTALVAARI DGTHASFAGC TLDAADFTSA RLPRANFQHA TLADAALACA
HCDGAEWYGA QAPRARLRAA SLRGSRADAS TSFRQADLSS AALDDANWDG VDLRGTNLHE
ATLDGASLAR ANASGAQLTR ARARRADLTQ ADLTHADARC SNLHGASLRR ARLGGTQLQS
SNLYGADCYG TALARPQLDG ANIERTLLAV PGRPELAASR