Gene Bpro_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1047 
Symbol 
ID4012265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1073749 
End bp1074717 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content63% 
IMG OID637940725 
Productextracellular solute-binding protein 
Protein accessionYP_547898 
Protein GI91786946 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.338181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.286195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAGA CAACCGTCAA GAAATTGGCT TCGCTGCTGG CGGCAGGCGC TTGCGCGGCC 
GGCATGGCCT CAGCCGCCGG GGCACAGGAA ACCAAGCTGA CGCTGGGCAT GTCTGGCTGG
ACCGGCTTCG CCCCGCTGTC GCTGGCGGAC AAGGCCGGCA TCTTCAAGAA AAACGGCCTG
GATGTGGAGA TCAAGTTCAT TCCGCAGAAA GACCGCCACC TGGCGCTGGC CGCCGGGGCC
ATCCAGTGCG CAGCCACCAC CGTGGAAACC CATGTGGCCT GGAACGCCAA TGGCGTGCCC
ATCGTGCAGA TCTTCCAGAT GGACAAATCC TATGGTGCCG ACGGCCTGGC CGTGCGTAAC
GACATCAAGA GCTTTGCCGA CCTGAAGGGC AAGACCATCG GCGTGGACGC CCCGGGCACC
GCGCCCTACT TCGGCCTGGC GTGGATGCTG AACAAAAACG GCATGACGCT CAAGGACGTG
AAGACCACCA CCCTCTCGCC GCAGGCGTCT GCCCAGGCCT TTGTCGCGGG CCAGAACGAC
GCGGCCATGA CCTACGAGCC CTACCTCTCC ACCGTGCGCG ACAACCCGGC CTCCGGCAAG
ATCCTGGCCA CCACGCTGGA CTACCCCATG GTGATGGACA CGGTCGGCTG CGCGCCGACC
TGGCTCAAGG CCAATCCCAA GGCTGCCCAG GCGCTCACCA ACTCCTACTT CGAGGCGCTG
GCCATGATCA AGGCCGACCC CGTCAAGTCC AATGAATTGA TGGGCTCGGC CGTCAAGCAG
ACCGGCGAGC AGTTTGCCAA GTCGGCAGCC TACCTGCGCT GGCAGGACAA GGCGGCCAAC
CAGAAGTTCT TCGCCGGCGA GATCACCGCG TTCATGAAAG ACGCCGAAAA GATCCTGCTG
GAGTCCGGCG TGATCCGCAA GGCGCCCGAG AACCTCGCGG CAACGTTTGA CACCAGCTTC
ATCAAGTAA
 
Protein sequence
MGKTTVKKLA SLLAAGACAA GMASAAGAQE TKLTLGMSGW TGFAPLSLAD KAGIFKKNGL 
DVEIKFIPQK DRHLALAAGA IQCAATTVET HVAWNANGVP IVQIFQMDKS YGADGLAVRN
DIKSFADLKG KTIGVDAPGT APYFGLAWML NKNGMTLKDV KTTTLSPQAS AQAFVAGQND
AAMTYEPYLS TVRDNPASGK ILATTLDYPM VMDTVGCAPT WLKANPKAAQ ALTNSYFEAL
AMIKADPVKS NELMGSAVKQ TGEQFAKSAA YLRWQDKAAN QKFFAGEITA FMKDAEKILL
ESGVIRKAPE NLAATFDTSF IK