Gene Bphyt_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_2114 
Symbol 
ID6282834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010681 
Strand
Start bp2381250 
End bp2382869 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content66% 
IMG OID642621673 
ProductMammalian cell entry related domain protein 
Protein accessionYP_001895739 
Protein GI187924097 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCC CACAAGGACC CGCCCTGCCG CCCGATCTGC CCGATCCCGA TATCGTGCCG 
CGGCGCGGCT GGTTGCCCTC GCTCGTCTGG GTCGTGCCGC TGATCGCGGC GTTGATCGGT
CTCGCGCTGG TCGTCAGGGC GGTCACGGAG CGCGGCCCGG CAATCACCAT CGTCTTCGAC
AACGCCGAAG GCCTCGAACC CGGCAAGACC CAGGTCAAGT ACAAGGACGT CGAAATCGGT
TCGGTGAAGT CGATCACGCT GTCGAAGGAT CGCACGCACG TGCAGATCGC CGTGCAACTC
ACCAGGCAGG CAGAGAACTT CGCTGTCAAG GACACCCGCT TCTGGGTGGT GCGCCCTCGC
GTAGGCGCCG CCGGCGTGTC GGGCATCGGC ACACTGCTCT CGGGCGCGTA TATCGGCGTG
GATGTCGGCC GCTCGACGGA GACGCGAACC GAGTTTGTCG GGCTGGAGAC GCCGCCGCCC
ATCACCGCCG CCCAGAAAGG CCACCGCTTC ACGTTGCACG GCGATTCGCT CGGCTCGATC
GATATCGGCT CGCCGATTTT CTACCGGCGC GTGCAGGTGG GTCAGGTCTA CGGCATTTCG
CTCGACAAGG ACGGCACGGG CGTGACCATG CAGGTGTTCG TCGCCGCGCC GTACGATCAG
TACGTCGGCT CGAATTCGCG CTGGTGGCAT GCGAGCGGCG TGGACGTGCG GCTCGATTCG
ACCGGCTTTG TCGTCAACAC GCAGTCGCTT GCGGCGATTC TGGTCGGCGG GCTCGCCTTC
CAGACGCCGC CAGGTCAGCC GATGGGCACG CCGGCCGCGG AGAAAACCGA CTTCCGGCTC
GCCGCCGACG AAGTGGACGC CATGCGCGCG CCAGACGGCA TACGGGTACG CACCGTGATG
GTCTTCAGTC AGTCGCTGCG CGGACTGTCG GTGGGCGCGA CGGTCGACTT CCGGGGCATC
GTGCTGGGCC AGGTCACGGA CATCGGCGTC GAATACGATC CGCAAGCGCG CAGCTTCGTC
ATGCCGGTGA CGCTGGATCT GTACCCTGAC CGCCTGCGCC GGCGGTCCCG CGGCGCGGCC
ATGCCCGAGG CGGGTACCGC GGCCAGCCAC GAACTGTTGC GGCGTCTCGT CGAGCGCGGC
TTGCGTGGGC AATTGCGCAC CGGCAACCTG CTGACGGGCC AGTTGTACAT CGCGCTCGAC
ATTTTCCCCA ACGCCGCGCC CGTCAAGTTC GACACCACTA ACGAGCCGAT CCAGCTGCCG
ACCATTCCAA ACACGCTCGA CGCGTTGCAA ACGCAGGTGG CCGACATCGC GAAGAAGCTC
GACCGGATTC CGTTCGATCA GCTCGGTTCG AATCTGAACA CGTCGCTTAA AAACGCCGAC
GCGCTGTTCA ACCGGCTCAA CAACGAAGTC GTGCCGCAGG CGCGCGACAC GCTCGCTGCC
GCGCGGCAAA CCTTCGGCTC GGCCGAGGCG ACTTTGCAAC AGGACTCGCC GTTGCAGTCC
GACGTGCATC AGGCGCTGCA GGAGTTGACC CGCACGCTAC GATCGCTGAA CGCGCTAGCC
GATTATCTGG AGCGCCATCC GGAGTCGCTG GTGCGCGGCA AACCGGGAGA CAAGCCATGA
 
Protein sequence
MSSPQGPALP PDLPDPDIVP RRGWLPSLVW VVPLIAALIG LALVVRAVTE RGPAITIVFD 
NAEGLEPGKT QVKYKDVEIG SVKSITLSKD RTHVQIAVQL TRQAENFAVK DTRFWVVRPR
VGAAGVSGIG TLLSGAYIGV DVGRSTETRT EFVGLETPPP ITAAQKGHRF TLHGDSLGSI
DIGSPIFYRR VQVGQVYGIS LDKDGTGVTM QVFVAAPYDQ YVGSNSRWWH ASGVDVRLDS
TGFVVNTQSL AAILVGGLAF QTPPGQPMGT PAAEKTDFRL AADEVDAMRA PDGIRVRTVM
VFSQSLRGLS VGATVDFRGI VLGQVTDIGV EYDPQARSFV MPVTLDLYPD RLRRRSRGAA
MPEAGTAASH ELLRRLVERG LRGQLRTGNL LTGQLYIALD IFPNAAPVKF DTTNEPIQLP
TIPNTLDALQ TQVADIAKKL DRIPFDQLGS NLNTSLKNAD ALFNRLNNEV VPQARDTLAA
ARQTFGSAEA TLQQDSPLQS DVHQALQELT RTLRSLNALA DYLERHPESL VRGKPGDKP