Gene BURPS668_A0440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0440 
Symbol 
ID4887279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp403694 
End bp405988 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content68% 
IMG OID640130381 
ProductTonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 
Protein accessionYP_001061446 
Protein GI126444863 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01785] TonB-dependent heme/hemoglobin receptor family protein
[TIGR01786] TonB-dependent hemoglobin/transferrin/lactoferrin receptor family protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGCGGC AGCCGCTTCG CGCCGCGCTG TTCGGGGCCT TCGGCCTCTA TGCGGCGGCC 
GCGCGCGCCG CCGGCCCCGC TTCCGAACCC GCGGCCGCCG CGCCGCCGGC CGCCGCATCC
GCCGCATCCA CGTCGCAGGT GCGGCACGCG GCGATCGCCG CCGCGCGCAA GGACGCGCCG
GCACTCGATC CGATCACCGT CACCGCGACG CGCACCGCGT CGGCCGCGAG CCGCACCGCG
GCGAGCGTAT CGGTGATCAC CGATTCAGAC CTCGAGGAGC AGCAGGCCGA CAACATCAAG
GACGCGCTGC GCTACGAGCC GGGCGTCACC GTGCGACGCA CCGCGTACCG TCCGGCGAAC
GCCGCGCTCG GCGGCGGCCG CGACGGCGAT TCGAGCATCA ACATCCGCGG CCTCGAAGGC
AACCGCGTGC TGCTGATGGA AGACGGCATC CGGCTGCCGA GCGCGTTCTC GTTCGGCCCG
CTCGAAGCCG GCCGCGGCGA TTACGCCGAT CTCGACACGC TCGCGCGCAT CGAGATCCTG
CGCGGTCCGG CGTCCGCGCT GTATGGCAGC GACGGTCTGA CGGGCGCCGT CAACTTCATC
ACGAAAGATC CGTCCGATCT GCTGTCGATC CATCGAAAAA AGACCTATTT CTCGTTCCGG
CCGAGCTACG ACTCGGTCGA CCGCAGCATC GGCGCGACCG TGACGGCGGC AGGCGGCAAC
GATCGCGTGC AGGCGATGCT GATCGCGTCC GGGCGCCGCG GCCACGAACT CGACACGCAC
GGCGACGACA ATTCCGCGAG CACGCGGCGC ACGCGCGCGA ATCCTCAGGA TGTCTACACG
GAATCGCTGC TCGGCAAGCT GACGATCACG CCGACCGCGC GCGACACGAT CAAGCTCGCC
GCCGAAACGG TGCGGCGGCG GATCGACACG AACGTGCTGT CGGCGATCAA TCCGCCGACA
ACGCTCGGCC TCACCGCGAA CGACAGGCTC GAGCGCAACC GCTTCAGTAT CGACTACGAT
TTGCGCGACG CCGCCGCGCG CGGGTTCCAG ACCGCGCACG TGCAGTTCTA CTATCAGGAG
TCGACGCAGG ATCAGGACGC GTTCGAGACG CGCGGCGGGC GGCTCCAATC GCGTTCGCGC
TCGAACCACT ACAGCGAGCG CGCGCTCGGC GGCTCCGCGT TCGCCGAGAG CGGCTTCGCG
ACCGGGCCGC TCGCGCACAA GCTGCTGTAC GGCGTCGACG GCAGCATCGA CCGCATCAAG
AGCCTGCGCG AGGGCACCGT CGCGAGCCCC GGCGAATCGT TCCCGAACAA GGCGTTTCCG
GACACCGACT ACTCGCTGTT CGGCGCGTTC GTGCAGGATC AGATCGGCTT CGGCAAGCTG
CTCGTCACGC CGGGCCTGCG CTTCGACGCG TATCGGCTCA GCCCGAGCTC GGGCGATCCG
CTGTTCACCG GCAAGACGGT CAGCTCGAGC GATCACGAGC TGTCGCCGCG CGTCGCGATG
CTCTATGAAG TGTCGCCCGC GCTGATTCCC TACGCGCAGT ATGCGCACGG CTTTCGCACG
CCGACGCCCG ATCAGGTCAA CAACAGCTTC TCGAATCCGA TCTATGGCTA TACATCGATC
GGCAATCCGA ACCTGAAGCC CGAGACGAGC GACACGCTCG AAGCGGGCCT GCGCGGCACG
CTCGGCACCG GCTACGGGCC GCTGCGCTAC AGCGTCGCCG CGTTCGCCGG CCGCTATCGC
AACTTCATCT CGCAGCGCAC GGTAGGCGGC AGTGGCCGGC CGAACGATCC GCTCGTGTTC
CAGTACGTGA ACTTCGCGAA CGCGCGCATT CACGGCTTCG AGGGACGCGC CGAATGGGTG
ATGCCGAATG GCTTCACGCT GAAGACGGCG ATGGCGTTCA CGAAGGGCAC GACGCAGGAC
AACGGCGCGG CGAGCGAGCC GCTCGATACG GTCAACCCGT TCTCCGCCGT GTTCGGCGTG
CGCTACGAGC CGAGCGAGCG CTGGTTCGCG CAGGCGGACC TGCTGTGGCA GGCGGGCAAG
CGCGGCCGCG ACGTGTCGTC GGCCGCGTGC CAGAAAAAGA CCTGCTTCAC GCCGCCGTCG
TCGTTCGTCG TCGATCTGCG CGGCGGCTAC CGCTTCAACA AGCACGTGAG CGCCTACCTC
GGCATTCACA ACCTGTTCGA CCGCAAGTAC TGGAACTGGT CGGACGTGCG CGGCATCGCC
GCCGATTCGA ACGTGCTCGA CGCATACACC GCCCCGGGCC GCAGCGTCGC GGTCAGCATG
AAGGTGGATT TCTGA
 
Protein sequence
MARQPLRAAL FGAFGLYAAA ARAAGPASEP AAAAPPAAAS AASTSQVRHA AIAAARKDAP 
ALDPITVTAT RTASAASRTA ASVSVITDSD LEEQQADNIK DALRYEPGVT VRRTAYRPAN
AALGGGRDGD SSINIRGLEG NRVLLMEDGI RLPSAFSFGP LEAGRGDYAD LDTLARIEIL
RGPASALYGS DGLTGAVNFI TKDPSDLLSI HRKKTYFSFR PSYDSVDRSI GATVTAAGGN
DRVQAMLIAS GRRGHELDTH GDDNSASTRR TRANPQDVYT ESLLGKLTIT PTARDTIKLA
AETVRRRIDT NVLSAINPPT TLGLTANDRL ERNRFSIDYD LRDAAARGFQ TAHVQFYYQE
STQDQDAFET RGGRLQSRSR SNHYSERALG GSAFAESGFA TGPLAHKLLY GVDGSIDRIK
SLREGTVASP GESFPNKAFP DTDYSLFGAF VQDQIGFGKL LVTPGLRFDA YRLSPSSGDP
LFTGKTVSSS DHELSPRVAM LYEVSPALIP YAQYAHGFRT PTPDQVNNSF SNPIYGYTSI
GNPNLKPETS DTLEAGLRGT LGTGYGPLRY SVAAFAGRYR NFISQRTVGG SGRPNDPLVF
QYVNFANARI HGFEGRAEWV MPNGFTLKTA MAFTKGTTQD NGAASEPLDT VNPFSAVFGV
RYEPSERWFA QADLLWQAGK RGRDVSSAAC QKKTCFTPPS SFVVDLRGGY RFNKHVSAYL
GIHNLFDRKY WNWSDVRGIA ADSNVLDAYT APGRSVAVSM KVDF