Gene BURPS1710b_A1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1781 
Symbol 
ID3694253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2169205 
End bp2171514 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content68% 
IMG OID637732035 
ProductTonB-dependent heme/hemoglobin receptor family protein 
Protein accessionYP_336938 
Protein GI76817936 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATCAAC CTCTGTTGGC GCGGCGGCCG CTTCGCGCCG CGCTGTTCGG GGCCTTCGGC 
CTCTATGCGG CGGCCGCGCG CGCCGCCGGC CCCGCTTCCG AACCCGCGGC CGCCGCGCCG
CCGGCCGCCG CATCCGCCGC ATCCACGTCG CAGGTGCGGC ACGCGGCGAT CGCCGCCGCG
CGCAAGGACG CGCCGGCACT CGATCCGATC ACCGTCACCG CGACGCGCAC CGCGTCGGCC
GCGAGCCGCA CCGCGGCGAG CGTATCGGTG ATCACCGATT CAGACCTCGA GGAGCAGCAG
GCCGACAACA TCAAGGACGC GCTGCGCTAC GAGCCGGGCG TCACCGTGCG ACGCACCGCG
TACCGCCCGG CGAACGCCGC GCTCGGCGGC GGCCGCGACG GCGATTCGAG CATCAACATC
CGCGGCCTCG AAGGCAACCG CGTGCTGCTG ATGGAAGACG GCATCCGGCT GCCGAGCGCG
TTCTCGTTCG GCCCGCTCGA AGCCGGCCGC GGCGATTACG CCGATCTCGA CACGCTCGCG
CGCATCGAGA TCCTGCGCGG TCCGGCGTCC GCGCTGTATG GCAGCGACGG CCTGACGGGC
GCCGTCAACT TCATCACGAA AGATCCGTCC GATCTGCTGT CGATCCATCG AAAAAAGACC
TATTTCTCGT TCCGGCCGAG CTACGACTCG GTCGACCGCA GCATCGGCGC GACCGTGACG
GCGGCGGGCG GCAACGATCG TGTGCAGGCG ATGCTGATCG CGTCCGGCCG CCGCGGCCAC
GAACTCGACA CGCACGGCGA CGACAATTCC GCGAGCACGC GGCGCACGCG CGCGAATCCT
CAGGATGTCT ACACGGAATC GCTGCTCGGC AAGCTGACGA TCACGCCGAC CGCGCGCGAC
ACGATCAAGC TCGCCGCCGA AACGGTGCGG CGGCGGATCG ACACGAACGT GCTGTCGGCG
ATCAATCCGC CGACAACGCT CGGCCTCACC GCGAACGACA GGCTCGAGCG CAACCGCTTC
AGCATCGACT ACGATTTGCG CGACGCCACC GCGCGCGGGT TCCAGACCGC GCACGTGCAG
TTCTACTATC AGGAGTCGAC GCAGGATCAG GACGCGTTCG AGACGCGCGG CGGGCGGCTC
CAATCGCGTT CGCGCTCGAA CCACTACAGC GAGCGCGCGC TCGGCGGCTC CGCGTTCGCC
GAGAGCGGCT TCGCGACCGG GCCGCTCGCG CACAAGCTGC TGTACGGCGT CGACGGCAGC
ATCGACCGCA TCAAGAGCCT GCGCGAGGGC ACCGTCGCGA GCCCCGGCGA ATCGTTCCCG
AACAAGGCGT TTCCGGACAC CGACTACTCG CTGTTCGGCG CGTTCGTGCA GGATCAGATC
GGCTTCGGCA AGCTGCTCGT CACGCCGGGC CTGCGCTTCG ACGCGTATCG GCTCAGCCCG
AGCTCGGGCG ATCCGCTGTT CACCGGCAAG ACGGTCAGCT CGAGCGATCA CGAGCTGTCG
CCGCGCGTCG CGATGCTCTA TGAAGTGTCG CCCGCGCTGA TTCCCTACGC GCAGTATGCG
CACGGCTTTC GCACGCCGAC GCCCGATCAG GTCAACAACA GCTTCTCGAA TCCGATCTAT
GGCTATACAT CGATCGGCAA TCCGAACCTG AAGCCCGAGA CGAGCGACAC GCTCGAAGCG
GGCCTGCGCG GCACGCTCGG CACCGGCTAC GGGCCGCTGC GCTACAGCGT CGCCGCGTTC
GCCGGCCGCT ATCGCAACTT CATCTCGCAG CGCACGGTAG GCGGCAGTGG CCGGCCGAAC
GATCCGCTCG TGTTCCAGTA CGTGAACTTC GCGAACGCGC GCATTCACGG CTTCGAGGGA
CGCGCCGAAT GGGTGATGCC GAATGGCTTC ACGCTGAAGA CGGCGATGGC GTTCACGAAG
GGCACGACGC AGGACAACGG CGCGGCGAGC GAGCCGCTCG ATACGGTCAA CCCGTTCTCC
GCCGTGTTCG GCGTGCGCTA CGAGCCGAGC GAGCGCTGGT TCGCGCAGGC GGACCTGCTG
TGGCAGGCGG GCAAGCGCGG CCGCGACGTG TCGTCGGCCG CGTGCCAGAA AAAGACCTGC
TTCACGCCGC CGTCGTCGTT CGTCGTCGAT CTGCGCGGCG GCTACCGCTT CAACAAGCAC
GTGAGCGCCT ACCTCGGCAT TCACAACCTG TTCGACCGCA AGTACTGGAA CTGGTCGGAC
GTGCGCGGCA TCGCCGCCGA TTCGAACGTG CTCGACGCAT ACACCGCCCC GGGCCGCAGC
GTCGCGGTCA GCATGAAGGT GGATTTCTGA
 
Protein sequence
MHQPLLARRP LRAALFGAFG LYAAAARAAG PASEPAAAAP PAAASAASTS QVRHAAIAAA 
RKDAPALDPI TVTATRTASA ASRTAASVSV ITDSDLEEQQ ADNIKDALRY EPGVTVRRTA
YRPANAALGG GRDGDSSINI RGLEGNRVLL MEDGIRLPSA FSFGPLEAGR GDYADLDTLA
RIEILRGPAS ALYGSDGLTG AVNFITKDPS DLLSIHRKKT YFSFRPSYDS VDRSIGATVT
AAGGNDRVQA MLIASGRRGH ELDTHGDDNS ASTRRTRANP QDVYTESLLG KLTITPTARD
TIKLAAETVR RRIDTNVLSA INPPTTLGLT ANDRLERNRF SIDYDLRDAT ARGFQTAHVQ
FYYQESTQDQ DAFETRGGRL QSRSRSNHYS ERALGGSAFA ESGFATGPLA HKLLYGVDGS
IDRIKSLREG TVASPGESFP NKAFPDTDYS LFGAFVQDQI GFGKLLVTPG LRFDAYRLSP
SSGDPLFTGK TVSSSDHELS PRVAMLYEVS PALIPYAQYA HGFRTPTPDQ VNNSFSNPIY
GYTSIGNPNL KPETSDTLEA GLRGTLGTGY GPLRYSVAAF AGRYRNFISQ RTVGGSGRPN
DPLVFQYVNF ANARIHGFEG RAEWVMPNGF TLKTAMAFTK GTTQDNGAAS EPLDTVNPFS
AVFGVRYEPS ERWFAQADLL WQAGKRGRDV SSAACQKKTC FTPPSSFVVD LRGGYRFNKH
VSAYLGIHNL FDRKYWNWSD VRGIAADSNV LDAYTAPGRS VAVSMKVDF