Gene BURPS1106A_A0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0347 
Symbol 
ID4904419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp330452 
End bp332731 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content68% 
IMG OID640143454 
ProductTonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 
Protein accessionYP_001074390 
Protein GI126456849 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGCGGC GGCCGCTTCG CGCCGCGCTG TTCGGGGCCT TCGGCCTCTA TGCGGCGGCC 
GCGCGCGCCG CCGGCCCCGC TTCCGAACCC GCGGCCGCCG CATCCGCCGC ATCCACGTCG
CAGGTGCGGC ACGCGGCGAT CGCCGCCGCA CGCAAGGACG CGCCGGCACT CGATCCGATC
ACCGTCACCG CGACGCGCAC CGCGTCGGCC GCGAGCCGCA CCGCGGCGAG CGTATCGGTG
ATCACCGATT CAGACCTCGA GGAGCAGCAG GCCGACAACA TCAAGGACGC GCTGCGCTAC
GAGCCGGGCG TCACCGTGCG ACGCACCGCG TACCGCCCGG CGAACGCCGC GCTCGGCGGC
GGCCGCGACG GCGATTCGAG CATCAACATC CGCGGCCTCG AAGGCAACCG CGTGCTGCTG
ATGGAAGACG GCATCCGGCT GCCGAGCGCG TTCTCGTTCG GCCCGCTCGA AGCCGGCCGC
GGCGATTACG CCGATCTCGA CACGCTCGCG CGCATCGAGA TCCTGCGCGG TCCGGCGTCC
GCGCTGTATG GCAGCGACGG CCTGACGGGC GCCGTCAACT TCATCACGAA AGATCCGTCC
GATCTGCTGT CGATCCATCG AAAAAAGACC TATTTCTCGT TCCGGCCGAG CTACGACTCG
GTCGACCGCA GCATCGGCGC GACCGTGACG GCGGCGGGCG GCAACGATCG TGTGCAGGCG
ATGCTGATCG CGTCCGGCCG CCGCGGCCAC GAACTCGACA CGCACGGCGA CGACAATTCC
GCGAGCACGC GGCGCACGCG CGCGAATCCT CAGGATGTCT ACACGGAATC GCTGCTCGGC
AAGCTGACGA TCACGCCGAC CGCGCGCGAC ACGATCAAGC TCGCCGCCGA AACGGTGCGG
CGGCGGATCG ACACGAACGT GCTGTCGGCG ATCAATCCGC CGACAACGCT CGGCCTCACC
GCGAACGACA GGCTCGAGCG CAACCGCTTC AGCATCGACT ACGATTTGCG CGACGCCACC
GCGCGCGGGT TCCAGACCGC GCACGTGCAG TTCTACTATC AGGAGTCGAC GCAGGATCAG
GACGCGTTCG AGACGCGCGG CGGGCGGCTC CAATCGCGTT CGCGCTCGAA CCACTACAGC
GAGCGCGCGC TCGGCGGCTC CGCGTTCGCC GAGAGCGGCT TCGCGACCGG GCCGCTCGCG
CACAAGCTGC TGTACGGCGT CGACGGCAGC ATCGACCGCA TCAAGAGCCT GCGCGAGGGC
ACCGTCGCGA GCCCCGGCGA ATCGTTCCCG AACAAGGCGT TTCCGGACAC CGACTACTCG
CTGTTCGGCG CGTTCGTGCA GGATCAGATC GGCTTCGGCA AGCTGCTCGT CACGCCGGGC
CTGCGCTTCG ACGCGTATCG GCTCAGCCCG AGCTCGGGCG ATCCGCTGTT CACCGGCAAG
ACGGTCAGCT CGAGCGATCA CGAGCTGTCG CCGCGCGTCG CGATGCTCTA TGAAGTGTCG
CCCGCGCTGA TTCCCTACGC GCAGTATGCG CACGGCTTTC GCACGCCGAC GCCCGATCAG
GTCAACAACA GCTTCTCGAA TCCGATCTAT GGCTATACAT CGATCGGCAA TCCGAACCTG
AAGCCCGAGA CGAGCGACAC GCTCGAAGCG GGCCTGCGCG GCACGCTCGG CACCGGCTAC
GGGCCGCTGC GCTACAGCGT CGCCGCGTTC GCCGGCCGCT ATCGCAACTT CATCTCGCAG
CGCACGGTGG GCGGCAGTGG CCGGCCGAAC GATCCGCTCG TGTTCCAGTA CGTGAACTTC
GCGAACGCGC GCATTCACGG CTTCGAGGGA CGCGCCGAAT GGGTGATGCC GAATGGCTTC
ACGCTGAAGA CGGCGATGGC GTTCACGAAG GGCACGACGC AGGACAACGG CGCGGCGAGC
GAGCCGCTCG ATACGGTCAA CCCGTTCTCC GCCGTGTTCG GCGTGCGCTA CGAGCCGAGC
GAGCGCTGGT TCGCGCAGGC GGACCTGCTG TGGCAGGCGG GCAAGCGCGG CCGCGACGTG
TCGTCGGCCG CGTGCCAGAA AAAGACCTGC TTCACGCCGC CGTCGTCGTT CGTCGTCGAT
CTGCGCGGCG GCTACCGCTT CAACAAGCAC GTGAGCGCCT ACCTCGGCAT TCACAACCTG
TTCGACCGCA AGTACTGGAA CTGGTCGGAC GTGCGCGGCA TCGCCGCCGA TTCGAACGTG
CTCGACGCAT ACACCGCCCC GGGCCGCAGC GTCGCGGTCA GCATGAAGGT GGATTTCTGA
 
Protein sequence
MARRPLRAAL FGAFGLYAAA ARAAGPASEP AAAASAASTS QVRHAAIAAA RKDAPALDPI 
TVTATRTASA ASRTAASVSV ITDSDLEEQQ ADNIKDALRY EPGVTVRRTA YRPANAALGG
GRDGDSSINI RGLEGNRVLL MEDGIRLPSA FSFGPLEAGR GDYADLDTLA RIEILRGPAS
ALYGSDGLTG AVNFITKDPS DLLSIHRKKT YFSFRPSYDS VDRSIGATVT AAGGNDRVQA
MLIASGRRGH ELDTHGDDNS ASTRRTRANP QDVYTESLLG KLTITPTARD TIKLAAETVR
RRIDTNVLSA INPPTTLGLT ANDRLERNRF SIDYDLRDAT ARGFQTAHVQ FYYQESTQDQ
DAFETRGGRL QSRSRSNHYS ERALGGSAFA ESGFATGPLA HKLLYGVDGS IDRIKSLREG
TVASPGESFP NKAFPDTDYS LFGAFVQDQI GFGKLLVTPG LRFDAYRLSP SSGDPLFTGK
TVSSSDHELS PRVAMLYEVS PALIPYAQYA HGFRTPTPDQ VNNSFSNPIY GYTSIGNPNL
KPETSDTLEA GLRGTLGTGY GPLRYSVAAF AGRYRNFISQ RTVGGSGRPN DPLVFQYVNF
ANARIHGFEG RAEWVMPNGF TLKTAMAFTK GTTQDNGAAS EPLDTVNPFS AVFGVRYEPS
ERWFAQADLL WQAGKRGRDV SSAACQKKTC FTPPSSFVVD LRGGYRFNKH VSAYLGIHNL
FDRKYWNWSD VRGIAADSNV LDAYTAPGRS VAVSMKVDF