Gene Bpro_5333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_5333 
Symbol 
ID4016292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007950 
Strand
Start bp93113 
End bp94414 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID637944955 
Producthomoserine dehydrogenase 
Protein accessionYP_552087 
Protein GI91791137 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.767379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTAG GGATATTAGG TTTTGGCAAC GTTGCCGCAG CAACGATCGA GGCCTTCACC 
ACCAATCAGG ATCTCATCCG ATCCAAGACG GAAACCCCCA TCGAGTTCGT GCGGGTTGCC
ACGCGCACGC CTTCACGCGC GCAGGGGCGC GTGCCTGGCG GCTGCGTCGT GTCCGATGAC
TGCTGGGCAG TGGTGGACGA TCCTCAGATC GACGTGGTCC TCGAACTCAT GGGCAACGTG
AAGCTGGGGC GTGAGCTCGT ACTCCGGGCC TTGTCCAACA GCAAGCACGT CATCACGGCC
AACAAGGCAC TCCTGGCCCA ACACGGCGAG GAGATCATGC AGATGGCGGA TCAAGCCGGG
TGCAGCGTGC TCTTCGAGGG TGCCGTGGCT GTTTCGATCC CCATCATCAA AACGCTCCGC
GAGTCGGCGG CCGCCAACCG GATCTCGTCC ATCATCGGCA TCCTGAACGG CACGTCGAAC
TACGTGCTGT CCCAGATGAG CGAGCACGGC GTCGACTTCG CGACCGCTGT CGCCGACGCG
CAGCGCAAGG GCTACGCGGA AGCAGACCCC ACGCTGGACG TGAATGGCGA GGATGCGGCG
CACAAGCTCA CGCTCCTGGC TTCGCTCGCC TTCGACGTAC CCATCAACTT CAGTGCGGTC
GAGTTCAAGG GTATTGCGGA GATCGACCAG GTCGACATCG AATTCGCCAA GCGACTCGCC
CACCAGGTCA AACTGATCGC GCAGGCCAGA CTGGAGTCGG GTGGCATAGC CATCAGCGTG
CAGCCCACCC TGGTGCCCGA CCGGTCGATG CTGGCGCGGG TCGGCGGCTC AATGAACGGC
ATCTCCCTGC AGGGTGACCT GCTGGGTCCG GCCTTCCTCT ACGGCTCGGG TGCCGGGGGC
CGGCAGACCT CCAGTGCCGT GCTGGCCGAC CTGCTTGAGC TGGCCAATCG CGGCGGTCCG
AACGGTGCGG GCGGCGGCCA CAACATGGGG TTCCGGCCTC GTGGGACCGG GCAGACCGAG
GTCCGCTACT GCAACGATCG CGTCGGTGCG TTCTACATCC GGCTGCGCCT GGACGACAAG
GCCGGCGCGC TGGCCAAGGT CAGCACGGTC CTAGCTGATG CCAACGTGTC GATCAACTCG
CTGCTGCAAG ATCAGGGACT CGATGGGTTG TCGGACCTGA TCGTGATCAC GCACGACATC
TCTAGCGGCC AACTCCGAGA TATGTTGCCC TATCTCCAAC AGGCTGCAGG CCCTGGCCAC
ACGGTCGTGA TCTATCCAGT TCTGGGTGAC TGCGGATGCT GA
 
Protein sequence
MKVGILGFGN VAAATIEAFT TNQDLIRSKT ETPIEFVRVA TRTPSRAQGR VPGGCVVSDD 
CWAVVDDPQI DVVLELMGNV KLGRELVLRA LSNSKHVITA NKALLAQHGE EIMQMADQAG
CSVLFEGAVA VSIPIIKTLR ESAAANRISS IIGILNGTSN YVLSQMSEHG VDFATAVADA
QRKGYAEADP TLDVNGEDAA HKLTLLASLA FDVPINFSAV EFKGIAEIDQ VDIEFAKRLA
HQVKLIAQAR LESGGIAISV QPTLVPDRSM LARVGGSMNG ISLQGDLLGP AFLYGSGAGG
RQTSSAVLAD LLELANRGGP NGAGGGHNMG FRPRGTGQTE VRYCNDRVGA FYIRLRLDDK
AGALAKVSTV LADANVSINS LLQDQGLDGL SDLIVITHDI SSGQLRDMLP YLQQAAGPGH
TVVIYPVLGD CGC