Gene Bphyt_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_4901 
Symbol 
ID6279615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp1025758 
End bp1027098 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID642615987 
ProductHipA N-terminal domain protein 
Protein accessionYP_001888635 
Protein GI187919604 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.069886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.125288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCGCC AAACTCACTC ACGCGCACTG TCGGTATGGG CCAACGGCGA GCGCGTCGGC 
GTCTGGCGCC TGCCCGCTCG CGGGCCCATG GAGCTCGCCT ACGATCCCGC GTGGGTCGCC
TCGCCGGCGG GACGGCCGCT GTCGCTGTCG CTGCCGTTCA CGCCGGGCAA TCTGGCGCAA
AAAGGCCCGC GCGTTCTCAA CTATTTCGAC AACCTGCTGC CCGACAGCGA GGCGATCCGA
AAGCGCATCG CCCAACGCTA CCAGACCGAG ACGCTCGATG CGTTCGATCT GCTGCAAGCC
ATCGGCCGCG ACTGCGTGGG CGCCGTCCAG CTGCTCGCCG AAGACGACGT GCCGCAAGGC
GTCGAGCAAA TCGAGGGCAC GCCGCTCACT GACAGCGAAA TCGAGACCAT GCTGGCGCGC
ACGGTCGGCA ACCCCGCGCT CGGCGCACCA GACCAGACGG ACGATTTCCG CATCTCGCTC
GCCGGCGCGC AGGAAAAAAC CGCGCTGCTG TGGCATGACG GCAAGTGGCA GCGGCCGCAT
GGCGCCACGC CCACCACGCA CATTTTCAAG CTGCCGCTCG GCCTTGTCGG CAACAAGCTC
GCCGACCTCA GCACCTCGGT CGAAAACGAG TGGCTCTGTC TGCGGATTCT GCGCGCCTAC
GGCCTGCCGG TCGCCAATAC GGAGATCATG ACGTTCGGCA AACAGCGGGT ATTGAGTGTC
GAGCGCTTCG ACCGGCAAAT GCATTCGAGC GGGCAATGGC TGCTGCGTCT GCCGCAGGAA
GACTTCTGCC AGGTGTACGG CGTGCCGTCG CATCGCAAAT ACGAAAACGA AGGCGGCCCT
GGTGTGCTCG ACCTCGCGCG AATTCTGCAG CAATCGGTCG AGGCGCGGCA GGACATCGAG
ACGCTGCTGG CGAGCCAGAT TCTGTTCTGG ATGCTGGCGG CGCCGGACGG CCACGCCAAG
AATTTCAGCA TCCGCCTGCT GGCGGGTGGC CACTACCGGC TCACACCGCT TTACGACGTG
ATGTCGATCT GGCCGGTGGA AGGCAGCGGC CCGAACCAGT GGTCATGGTT CAAGGCGCGG
CTCGCCATGG GCATGTGGTC GCGCAGCAAG CACGACGCGT TTCGCGACGT GCAGCGGCGG
CACTTCAACA CCATGGCGCT GAAGTGCTCG TACGGCGCGG ACGCGGAACC GCTGATCCAG
CGGTTGATCG AGCAGACTCC CGGCGTGATC GAGCGGGTCT CCGCGGAATT GCCCGAACGT
TTTCCGGCCA AGGTCGCCGA ACGGATTTTC AAAGGCCTGA AAAACTCGGC GGCGAAGCTC
GGCACGATGT CTGCTGGCTA G
 
Protein sequence
MGRQTHSRAL SVWANGERVG VWRLPARGPM ELAYDPAWVA SPAGRPLSLS LPFTPGNLAQ 
KGPRVLNYFD NLLPDSEAIR KRIAQRYQTE TLDAFDLLQA IGRDCVGAVQ LLAEDDVPQG
VEQIEGTPLT DSEIETMLAR TVGNPALGAP DQTDDFRISL AGAQEKTALL WHDGKWQRPH
GATPTTHIFK LPLGLVGNKL ADLSTSVENE WLCLRILRAY GLPVANTEIM TFGKQRVLSV
ERFDRQMHSS GQWLLRLPQE DFCQVYGVPS HRKYENEGGP GVLDLARILQ QSVEARQDIE
TLLASQILFW MLAAPDGHAK NFSIRLLAGG HYRLTPLYDV MSIWPVEGSG PNQWSWFKAR
LAMGMWSRSK HDAFRDVQRR HFNTMALKCS YGADAEPLIQ RLIEQTPGVI ERVSAELPER
FPAKVAERIF KGLKNSAAKL GTMSAG