Gene BURPS1710b_A2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2572 
Symbol 
ID3692461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp3091289 
End bp3094678 
Gene Length3390 bp 
Protein Length1129 aa 
Translation table11 
GC content72% 
IMG OID637732826 
Productserine protease 
Protein accessionYP_337722 
Protein GI76818314 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGGC ACAAAAAACG GAAAACGATG AAGCGCAGCG GAGCGAAGCT GCTTGCGCCC 
GTGGTCGTCG CGGCCGCCGC CGCGGTCGCG GCCCGCCCCG GCTGGGCGCA GGCCGCGCCG
TACCCGGATC CGGGCCGGCG CGGCGATCCG GCGAGTTGGC GCACGCCGGA ATTCACGAAC
GCGTGGGGGC TCGGCGCGAT GCACGCCGAG TATGCATACG CGGCCGGCTA TACCGGCGCG
AACGTCGCGA TCGGCGTGCT GGACTCCGGC TACTACGCGC AGCATCCGGA ACTACCCGAC
AGCCGCTTCG TTCCGGTGAC GGCCGCGGGC GTGTCCGGCG TGCTGAACCC GAACAACAAC
AATCACGGCA CGCTGGTGAG CGGCGTCGTC GGCGGCGCGC GCGACGGTGT CGGCATGCAC
GGCGTCGCAC CCGACGCGAC GGTGTACGAA GGCAACACGA ACGCGACCGA CGGCTTCCGG
TTCGGCGTAT CGGATCCGAA GTTTCCCGCG TCGGATGCGA AGTATTTCGC CGAGGCCTAC
GATGCGCTCG CCGCAAAGGG CGTGCGGATC ATCAGCAACA GCTGGGGCTC GCAGCCGGCC
AACGAGAACT ACAGCACCCT GAACAAACTC ACCGATGCCT ACAAGCTGCA CGAGGCGGTG
CGCACGGCGA CCGGCCGGGG CACGTGGCTC GACGCGGCGG CGAAGGTGTC GCGCGACGGC
GTGATCAACA ACTTCAGCTC GGGCAACACC GGCTACGACA ACGCGAGCCT GCGCGGCGCG
TATGCGTACT TCCACCCCGA ACTCGAAGGG CACTGGATGA CGACGACGGG CTACGACCAG
TTGAGCGGCC AGGTCTACAA CCAATGCGGG ATCGCGAAGT GGTGGTGCGT GATGGCGCCC
ACGGGCGTGC CGTCGACGTC ATATTCGGGC GGCGCGGCGG CGCCGACCGG GGCGACCTAC
GCGAACTTCA ACGGCACGTC GGCCGCCGCG CCGCACGCGT CGGCGGCGCT CGCGCTGATC
ATGGAGCGCT TTCCGTACAT GACGAGCGAG CAGGCGCTGT CGGTGCTGTT CACGACCGCG
CAGAACATGG AGCCGGACCC GAGCCGGCCG GACTACACGA ACAACGGGCT GTTCTCGACC
GTGCATCCGG CGAAGCCCGG CGCGTCGGGC GTGCCGAACG CGTTCGGCGG CTGGGGGCTC
GTCGACCTGC GCCGGGCGAT GAACGGCCCG GGCCAACTGC TCGGCACGTT CAACGCGGCG
CTGCCTGCGG GCACCGCCGA CGTGTGGTCG AACGACATCT CCGACGTCGC GCTCGCCGCG
CGCAAGCGCG AGGACGACGC CGAGCACCGC GCGTGGCTCG ACACGCTGAG GACGAAGGGA
TGGGAGCACG GGCTGCCCGC CGGCGCGAGC GATGGCGACC GGATCGACTA TGCGCTCGGC
GTCGCGCGCG AAACCGCGTA TCAGGCGCGC GAGTATCAGG GCAGTCTCGT GAAATCGGGC
GGCGGCACGC TGACGCTCGC CGGCGCGAAC ACCTATCGCG GGCCCACGAC GGTCGACGGC
GGCGAATTGA GGATCGACGG CTCGATCGCC GCGCGCGCCG TCGTCAATCC GGCGGGCCGG
CTCACGGTGA ACGGCCGCGC GGCCGACATC GCGGTCAACG GCGGCGTGGC GACGATCGCC
GGGACGAGCG CGAACCTGTC GATCGACCGG CAGGGCCGGG CCGCCGTGAC AGGGACGACG
GCGGACGTGC GCGTCGCGAG CGGCTTTGCA TCGCTCGGCG GCACGAGCGG CAACGTCGCG
GTGGGGGCGC TCGGCGTCGC CGCGATCACG GGCCGCACGG CCGACGTGGC GGTCGACGGC
GGCCGCGCGT CGCTCGACGG CGCGAGCGGC AACGTCGCGG TCGGCAACGG CGGCGTCGTG
AGCGGCAGCG GCACGGTGCG CACGCTCACG GCGGCCGCGA ACGGCACGGT CGCGCCCGGC
CATTCGGTCG GTACGCTGAC GGTGTCGGGC GACGTGCGCT TCGCGCCGGG TTCGATCTAC
GCGGTCGAGG TGTCGCCGGG CGGCGCGGGC GACCGGATCG TCGCGGGCGG CCGCGCGCAA
ATCGACGGCG GCGCGTTGGC GCTCGCGCTC GAGAACACGC CGCCGCCGCT CACGCCCGAG
CAGTCGCGCT CGGTGCTCGG CCGTCGCTTC GAGATCCTGA ATGCGGCGGG CGGCGTGGCC
GGCCGTTTCG ATGCGCCGAG CGGCTATCTG TTCGTCAATC CGGTGCTCGC CTATGGCCCG
ACGACCGTGA GCCTCACGAT CGATCGTAAC GCGACGCCGT TTGCAAGCGT CGCGCGGACC
GCGAACGAGC GCGGCGTCGC CGATGCGCTC GAAACGGCGG ACCCGGGCAG CGCGGTTTAC
AACAGCGTGC TGTTCGCGGC GTCCGCGCAG GCGCCGCAGG CGACGCTCGC GCAACTGACG
GGCGAGATCT ATCCGGCCGC CTACGCGGCG CTCGTCAACG AAAGCCGGCA AGTGCGCGAA
GCGGCGCTCG AGCGCCTGTG GACGGCGCGC GGCGCGCCGG GCCGCGCCGG CGCCTGGGCG
CGGCTGCTCG GCGCGTGGGG CAGCGCGCGC GGCGGCGACG TGAACGGCTA CACGAGCTCG
ACGGGCGGCT TCCTCGCGGG CGCGGACGCG GCGCTGCTCG ACAGCGTGCG GGCGGGCGGC
TTCGCCGGCT ACAGCCACAC CGGCGTGAAC CTGAGGAATC AGCCGTCGTC CGCGTCGTTC
GACAGCTTCC ATCTCGGCGC ATACGCGGGG TGGCAGCCCG GCGCACTCGG CGTGCGAATC
GGCGCGGCGC ATGCGTGGCA TCGCGGCGGT GTCGATCGCG CGGTGCAATA TGGCGCGGTT
GCCGAGAACG AAACGACGGC GCTGCACGCG GAAACGACGC AGGTGTTCGG CGAGGCCGGC
TATCGGTTCG CGCTCGATGG CGCCGCGACG CTCGAGCCGT TCTTCGGCGT CGCGTATGTG
CATCTGAAGA ACCAGGGGAC GACGGAAACC GGCGGCGCGG CGGCGTTGCG CGTGCGGCAA
GGCAATCACG ACGTGACGTT CTCGACGCTC GGCGTGCGCG GCGAAACGCG GCTTGGCCTG
ACGTCGCGAC TGCAGTTGAC GCTGCAGGGC AGCGCGGGCT GGCAGCATGC GCTGACGGAC
GGGCAGCCGA GCGGCACGCT CGCGTTCGCG ACGGGGAGCG ACACGTTCAC CGTGTCGAGC
GTGCCGGTTG CGAAGGATGC GGCGGTGCTG AACGTGGGCG CCGGGCTCGA GCTCGGCAAG
AACGGATGGC TGCGCGTCGG CTATTCCGGC TCGCTCGCGA GCCGTCAGTC CGAGCACGCG
GTGCAAGGCA GCCTGCACTG GAAGTTCTGA
 
Protein sequence
MTRHKKRKTM KRSGAKLLAP VVVAAAAAVA ARPGWAQAAP YPDPGRRGDP ASWRTPEFTN 
AWGLGAMHAE YAYAAGYTGA NVAIGVLDSG YYAQHPELPD SRFVPVTAAG VSGVLNPNNN
NHGTLVSGVV GGARDGVGMH GVAPDATVYE GNTNATDGFR FGVSDPKFPA SDAKYFAEAY
DALAAKGVRI ISNSWGSQPA NENYSTLNKL TDAYKLHEAV RTATGRGTWL DAAAKVSRDG
VINNFSSGNT GYDNASLRGA YAYFHPELEG HWMTTTGYDQ LSGQVYNQCG IAKWWCVMAP
TGVPSTSYSG GAAAPTGATY ANFNGTSAAA PHASAALALI MERFPYMTSE QALSVLFTTA
QNMEPDPSRP DYTNNGLFST VHPAKPGASG VPNAFGGWGL VDLRRAMNGP GQLLGTFNAA
LPAGTADVWS NDISDVALAA RKREDDAEHR AWLDTLRTKG WEHGLPAGAS DGDRIDYALG
VARETAYQAR EYQGSLVKSG GGTLTLAGAN TYRGPTTVDG GELRIDGSIA ARAVVNPAGR
LTVNGRAADI AVNGGVATIA GTSANLSIDR QGRAAVTGTT ADVRVASGFA SLGGTSGNVA
VGALGVAAIT GRTADVAVDG GRASLDGASG NVAVGNGGVV SGSGTVRTLT AAANGTVAPG
HSVGTLTVSG DVRFAPGSIY AVEVSPGGAG DRIVAGGRAQ IDGGALALAL ENTPPPLTPE
QSRSVLGRRF EILNAAGGVA GRFDAPSGYL FVNPVLAYGP TTVSLTIDRN ATPFASVART
ANERGVADAL ETADPGSAVY NSVLFAASAQ APQATLAQLT GEIYPAAYAA LVNESRQVRE
AALERLWTAR GAPGRAGAWA RLLGAWGSAR GGDVNGYTSS TGGFLAGADA ALLDSVRAGG
FAGYSHTGVN LRNQPSSASF DSFHLGAYAG WQPGALGVRI GAAHAWHRGG VDRAVQYGAV
AENETTALHA ETTQVFGEAG YRFALDGAAT LEPFFGVAYV HLKNQGTTET GGAAALRVRQ
GNHDVTFSTL GVRGETRLGL TSRLQLTLQG SAGWQHALTD GQPSGTLAFA TGSDTFTVSS
VPVAKDAAVL NVGAGLELGK NGWLRVGYSG SLASRQSEHA VQGSLHWKF