Gene BURPS668_A2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2507 
Symbol 
ID4885982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2421456 
End bp2423693 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content69% 
IMG OID640132444 
ProductTonB-dependent copper receptor 
Protein accessionYP_001063501 
Protein GI126443902 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01778] TonB-dependent copper receptor 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCA CATTCCTGCG TCACGCGCCC GCCGCGCGTG GAGACCACGC GCGCCGCCGG 
CGGCGCGCGA TCACGCTTAC CGTTCCCGCG CTCGCCGCGG GCGCCTTTCA CCTCGCGCCG
GCCGTCGCGC AGACGAGCGA GGCCGTGCAC GGCCACGGCA CGCTCGGCGC GTCCGGCGTA
GCGAGCCGCG CCGAGACCGA CGCGACGAGC GCCAAATCGG ACGGCGCGGT TCGCGAAGCG
GCGAGCCGCA CGACAACCGG CGCCGCGCCG GACGCCACGA TGCTGCCGAC GATCGAAATC
GTCGCGGCGC CCGAATCGAC GCCGCTCGTC GTCGTCACCG ATCCGAAGAC GCCGCGCCAG
CCGCTGCCCG CGAGCGACGG CGCCGATTAT CTGAAGACGA TTCCCGGCTT CGCGTCGATC
CGCAGCGGCG GCACGAACGG CGACCCGGTG CTGCGCGGGA TGTTCGGCTC GCGGCTGAAC
ATTCTCGCGA ACGGCATGCC GACGCTCGGT GCGTGTCCCG GCCGGATGGA CGCGCCGACG
TCGTACATCG CGCCCGAGAG CTACGACAAG GTGACGCTCG TCAAGGGGCC GCAGACCGTG
CTGTACGGGC CGGGCGCATC GGCGGGCACG GTGCTGTTCG AGCGCGTGAC GCCGCGCTTC
AAGACGCCGG GCATGCGCTT TGACGGCAGC GTCGTCGGCG GCTCGTTCGG GCGCAACGAT
CAGAACGTCG ACGTGACGGC CGGCACGCCC GACTTCTACG GGCGCGTGAG CGCGAACCAT
GCGCACTCGC AGGACTACGA GGACGGCAAC GGCCGCACGG TGCCGTCGCA ATGGGACAAG
TGGAACGCGG ATGCGGCGCT CGGCTGGACG CCCGACGACA ACACGCGGCT CGAGCTGACG
GCAGGCACGG GCGACGGCTA CGCGCGCTAT GCGGGCCGCG GAATGGACGG CGCGCATTTC
CGGCGCGAGA CGTTCGGTCT GAAGTTCGAC AAGAAGCACA TCGGCGACGT GCTCGATCGC
ATCGAGGCGC AGGTCTTCTA CAACGAAGCC GATCACGTGA TGGACAACTA CACGCTGCGG
ATGCCCGATC CGACGAGCAG CATGCCGATG CGCATGGCCT CCGAAGTGCG CCGCCGCACG
CTCGGCGCGC GCGTCGCGGC GACGCTGCGC TTGACCGACG CGTTCAAGCT CGTGACGGGC
GTCGATGCGC AGTCGAACCG CCTCGACTCG CGCTCGGCGA TGGGGATGCA GAACTACGGC
GACAAGCCGT GGAATCCGCA GGCGAACATG TGGAACGCGG GCGCGTTCGG CGAGCTGACC
TGGTATGCGA GCGATGCGTC GCGCGTGATC GGCGGCGCGC GGATCGACTA TGCGGCCGCG
CGCGACAAGC GCGCGACGAC GGGCGGCATG AAGATGAGCA TGCGCAATCC GACGTTCGAC
GATCTCCGCT CGCGCGTGCT GCCGAGCGGC TTCGTGCGCT ACGAGCGTGA TCTCGCGTCG
CTGCCCGTCA CGTGGTACGC GGGCATCGGC CATGCGCAGC GCTTTCCTGA TTACTGGGAG
CTGTTCTCCG CCAAGCGCGG CCCGAACGGT TCGATCAACG CGTTCTCCGC GATCAAGCCC
GAGAAGACGA CGCAGCTCGA CATCGGCGCG CAGTACAAGA GCGACAAGCT CGACGCCTGG
GTGTCCGCCT ATGCGGGCTA CGTGCAGGAC TTCATCCTGT TCGACTATGC GACGGGCCCG
ATGGGACAGA TCACGCGGGC GACGAACGTC AACGCGCAGA TCATGGGCGG TGAGGTGGGC
GCGTCGTGGC GTCCGCTCGC GCCGTGGCGC TTCGAAGGGT CGCTCGCGTA TGCGTGGGGG
CGCAACGTGC AAAGCGGTGC GCCGCTGCCG CAGATGCCGC CGCTCGAGGC ACGCTTCGGC
GTCGAGTACA CTCGCGGGCC GTGGTCGGCG GGCGGGCTGT GGCGGGTCGT TGCGCCGCAG
CATCGCTACG CGCTGAACGA GGGCAACGTC GTCGGCAAGG ACTTTGGTCC GAGCGCCGGT
TTCGGCGTGC TGTCGCTGCA CGCGCAGTAC CACGTGAGCA AGACGGTGCA GATCTCGGTC
GGCGTCGACA ACGTGCTCGA CAAGGCTTAT GCGGAGCACC TGAACCTCGC GGGCAACGCC
GGTTTCGGCT ATCCGGCGAA TCTGCCTGTC ACCGAACCCG GCCGCACCGC GTGGGTTCGT
TTGAGCACCA AGCTCTGA
 
Protein sequence
MTSTFLRHAP AARGDHARRR RRAITLTVPA LAAGAFHLAP AVAQTSEAVH GHGTLGASGV 
ASRAETDATS AKSDGAVREA ASRTTTGAAP DATMLPTIEI VAAPESTPLV VVTDPKTPRQ
PLPASDGADY LKTIPGFASI RSGGTNGDPV LRGMFGSRLN ILANGMPTLG ACPGRMDAPT
SYIAPESYDK VTLVKGPQTV LYGPGASAGT VLFERVTPRF KTPGMRFDGS VVGGSFGRND
QNVDVTAGTP DFYGRVSANH AHSQDYEDGN GRTVPSQWDK WNADAALGWT PDDNTRLELT
AGTGDGYARY AGRGMDGAHF RRETFGLKFD KKHIGDVLDR IEAQVFYNEA DHVMDNYTLR
MPDPTSSMPM RMASEVRRRT LGARVAATLR LTDAFKLVTG VDAQSNRLDS RSAMGMQNYG
DKPWNPQANM WNAGAFGELT WYASDASRVI GGARIDYAAA RDKRATTGGM KMSMRNPTFD
DLRSRVLPSG FVRYERDLAS LPVTWYAGIG HAQRFPDYWE LFSAKRGPNG SINAFSAIKP
EKTTQLDIGA QYKSDKLDAW VSAYAGYVQD FILFDYATGP MGQITRATNV NAQIMGGEVG
ASWRPLAPWR FEGSLAYAWG RNVQSGAPLP QMPPLEARFG VEYTRGPWSA GGLWRVVAPQ
HRYALNEGNV VGKDFGPSAG FGVLSLHAQY HVSKTVQISV GVDNVLDKAY AEHLNLAGNA
GFGYPANLPV TEPGRTAWVR LSTKL