Gene Bphyt_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphyt_4038 
Symbol 
ID6278854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phytofirmans PsJN 
KingdomBacteria 
Replicon accessionNC_010676 
Strand
Start bp53349 
End bp54419 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID642615141 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_001887794 
Protein GI187918763 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAAA ACAACGGAGA CATCGTCACC ATGCCTCATG CTGTGACGCG CGTTGCGCGC 
GCCCTCCATG CCGCCGTCGC GGCGTCGCTC ACGATCTTCC TGCTCGCGCA GCCGGCTGCC
CCCGCGCGGG CCGACTCGCC CGAAAAAATC GTCATCATGG TGGGCGGTAT TACCAAGCTC
ATCTATCTGC CCGCACGCCT GACCGAACAG CTTGGGTACT TCAAGGCGGA AGGACTCGAC
GTCGAACTGC AATCGCAGCC GGCGGGCGTC GATGCGGAAA ACGAACTGCT CGCGGGCGGC
GTGCAGGCCG TAGTCGGCTT CTACGATCAC GCGATCGATC TGCAAGCCAA GGGTAAGGAA
ATCAAGGCGA TCGTCGTGTT TGGCCAGGTG CCCGGCGAAG TGGAGATGGT CGCGGCCAGG
GCGGCCGGTT CGATCAGAAG CATGGCCGAC GTGAAGGGTA AAACGCTCGG CGTGACGGGG
CTCGGTTCTT CGACCAACTT TCTCACGCAG TACCTCGCCA GCCTCAAAGG CGTGCCGCGT
TCACAGTACA CGGTGCTGCC GGTGGGCGCG GACAACAGCT TTATCGCGGC GATCCGGCAA
GGGCGTATCG ATGCGGGCAT GACTACCGAG CCGACTGTCT CGCAACTGCT GAAATCCGGC
GACGCCAGGG TACTGGTGGA CATGCGTAAT GTCGAAGGTA CACGCGCCGC GCTCGGTGGA
ACTTATCCGG CCTCAAGCCT GTATGTGCAG AGCGCGTGGC TCGACACGCA CCCGCAAGAG
GCGGCCAAAC TGGCGCGCGC ATTGGTGAAG ACGTTGCGAT ATCTGAATAC ACATAGCGCC
GAAGAGATCG CCGCGCAGAT GCCGAAAGAC TACATCGGCA ATGACGAGGC GCTTTATGTG
AGCGCGTTGA AGGCCTCGCT GCCGATGTTC ACCGCCGACG GCAAGATGCC CGCCGACGGG
CCGGAAACGG TGCTCAAGGT GCTGGCGGGT TTCAACCCTT CGGTGAAGGG TCGTCATATC
GATCTGTCGA GAACCTTCAC CAATCAGTTC GTCAATGAAG TGAAACCGTA G
 
Protein sequence
MRKNNGDIVT MPHAVTRVAR ALHAAVAASL TIFLLAQPAA PARADSPEKI VIMVGGITKL 
IYLPARLTEQ LGYFKAEGLD VELQSQPAGV DAENELLAGG VQAVVGFYDH AIDLQAKGKE
IKAIVVFGQV PGEVEMVAAR AAGSIRSMAD VKGKTLGVTG LGSSTNFLTQ YLASLKGVPR
SQYTVLPVGA DNSFIAAIRQ GRIDAGMTTE PTVSQLLKSG DARVLVDMRN VEGTRAALGG
TYPASSLYVQ SAWLDTHPQE AAKLARALVK TLRYLNTHSA EEIAAQMPKD YIGNDEALYV
SALKASLPMF TADGKMPADG PETVLKVLAG FNPSVKGRHI DLSRTFTNQF VNEVKP