Gene Bphy_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_3403 
Symbol 
ID6244964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp335679 
End bp337427 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content63% 
IMG OID642595191 
Producttriple helix repeat-containing collagen 
Protein accessionYP_001859603 
Protein GI186472261 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0000154784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACGC CGACCGTCGA GATCATCGAT GTCGTGGCGA TCGACGCCGA ACGCATGACG 
GTCGCCGTTA CCGTCGATAT CGAGCCCGAG GAAGTCGACA TCACTGACGT GGAAGCGGCG
AATGCCATCC TGATCGAGAT CATCGAGCAG TCCAACCGTT CAGGGCCGCC CGGCCCGATA
GGTCCGCAGG GTCCGGCCGG TCCGCAAGGT GTCCAGGGCA TTGCTGGCAA CGATGGCAAG
GATGGCGCGA CCGGCCCCGC CGGCGCGAGA GGTCCTGCAG GCCCGCAAGG CGACGCGGGT
CCGCAGGGTG TCGCCGGTCC TGCTGGCACA ACGGGTCCGC AGGGTGCGAC GGGGCCGGCC
GGACCCAGGG GCGATCCCGG CGTGGCCGGT CCGCAAGGCG CACAGGGTCT CAAGGGCGAT
CAGGGTCCGA AAGGCGATGC AGGTCCGCAA GGTCCGGCCG GTGCGACGGG CTCGCAGGGT
CCGGCCGGTG CGATGGGTCT CACGGGTCCG CAAGGAGCGA AAGGCGATCC CGGTCCCGTG
GGTCCGATGG GTCCCGAAGG CGACACGGGT CCGGCTGGCG ATCAGGGTCC GGAGGGAGAC
ACAGGCGCGC AGGGTCCGCA GGGTGCAACG GGTTCACAGG GACCCGCTGG TCCGGTCGGG
CAGACTGGCC CGCAGGGTGT CAAGGGCGAC ACCGGCGCGA CCGGCGAGCA GGGTCCGAAA
GGCGACACGG GTGCGCAAGG TCCGCAAGGC CCGGTTGGTC CAGCCGCCGA TACTTCGACC
TTCGTGCAGA AGTCCGGCGA CACGATGACG GGGCAATTGC GGATTGCGCC GGCGTCGGGT
GACGCGTCTC TATTGCTCAA TGCGCAGGGT TCAAACCTGC CCCGTATCTA TTTTCAGAAA
GGCGGGGCTG GCCCCAGCAT GCTCTACGAT GGTACGTCGA TAGGGTTTGC AAACGCCACG
CTCAACGCGT GGAATATGGC TATTAACGAC TCTGGCTCTG TCTCGTTTCG CAACACGGTC
AATATCAACA ACGGAACCGA CCTCTGGTTA AGCGCACAGT CCGGCAGCTA TAGCGGCAGG
CTCATGCTAA ACGCGAACGG CTACGCGCCG TTCCTTCGAT CCAACGGGGC CAACGGCAAT
ATTGAAGTTG TCAATTCGGC CAACAGCGCC GTTAACTTTA CGATCTGGGA TGACGGGCAC
GTAACCGCAC GCGGTAACGT CTACGCGTCG AGTGCTTTCC TGCAAACCAA TGGCGACGTT
AACGGCTCGC AATGGGTAGG CGGCTGGTTA AGTGCAGGCA TTAACAATAT CTGCATGACG
CGCACCACGG GCAACACGTA CAAAATCGGT TGGGATGGCG GCGCAGGCAA CCTCAACTTC
TATGTCGACG GCACATTTAT AGCGTATCTG CACAGCAACG CCTCGGACTC CGCGCTTAAA
GCCAACGTCG AGGAAGTTAC GCCCGACTCG CTTACCCTCA TTGAACAGAT AGACTTTTGC
AGTTACGACA TTGCCGGGCG GCATGTCGAC ATGGGCATCA TTGCGCAGCA GCTACAGACC
ATCACGCCGC GTTGGGCGTA TAAACCGCCG ACGCCTGATA CGCCCGAACC GTACTACGGC
GACCCGGAAC CGCAACCCTA TACCCCTTCG CCTCTGGCGT ATGACCGCGA CGCGCTGCTG
TTCGATGCGT TGCGCGCCGT GCAGCAGCTA TCGGCACGCG TGACACAGCT TGAGGCGATG
CGGCCATGA
 
Protein sequence
MSTPTVEIID VVAIDAERMT VAVTVDIEPE EVDITDVEAA NAILIEIIEQ SNRSGPPGPI 
GPQGPAGPQG VQGIAGNDGK DGATGPAGAR GPAGPQGDAG PQGVAGPAGT TGPQGATGPA
GPRGDPGVAG PQGAQGLKGD QGPKGDAGPQ GPAGATGSQG PAGAMGLTGP QGAKGDPGPV
GPMGPEGDTG PAGDQGPEGD TGAQGPQGAT GSQGPAGPVG QTGPQGVKGD TGATGEQGPK
GDTGAQGPQG PVGPAADTST FVQKSGDTMT GQLRIAPASG DASLLLNAQG SNLPRIYFQK
GGAGPSMLYD GTSIGFANAT LNAWNMAIND SGSVSFRNTV NINNGTDLWL SAQSGSYSGR
LMLNANGYAP FLRSNGANGN IEVVNSANSA VNFTIWDDGH VTARGNVYAS SAFLQTNGDV
NGSQWVGGWL SAGINNICMT RTTGNTYKIG WDGGAGNLNF YVDGTFIAYL HSNASDSALK
ANVEEVTPDS LTLIEQIDFC SYDIAGRHVD MGIIAQQLQT ITPRWAYKPP TPDTPEPYYG
DPEPQPYTPS PLAYDRDALL FDALRAVQQL SARVTQLEAM RP